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RAPID DETECTION AND IDENTIFICATION OF PATHOGENS 



This invention was made with government support under Cooperative 
Agreement 70NANB5H1030 awarded by the Department of Commerce, National 
Institute of Standards and Technology, Advanced Technology Program. The 
Government has certain rights in the invention. 

This is a Continuing Patent Application of U.S. Patent Appln. Ser. No. 
08/520,946, filed August 30, 1995, which is a Continuation-in-Part application of U.S. 
Patent Appln. Ser. No. 08/484,956, filed June 7, 1995, now U.S. Patent No. 5,843,654, 
issued December 1, 1998, which is a Continuation-in-Part application of U.S. Patent 
Appln Ser No. 08/402,601, filed March 9, 1995, now abandoned which is a 
Continuation-in-Part Application of Application Serial No. 08/337,164, filed November 
9, 1994, now abandoned, which is a Continuation-in-Part Application of Application 
Serial No. 08/254,359, filed June 6, 1994, now U.S. Patent No. 5,614,402, issued 
March 25, 1997, which is a Continuation-in-Part Application of Application Serial 
No. 08/073,384, filed June 4, 1993, now U.S. Patent No. 5,541,311, issued June 30, 
1996, which is a Continuation-in-Part Application of Application Serial 
No. 07/986,330, filed December 7, 1992, now U.S. Patent No. 5, 4/2,253. 

FIELD OF THE INVENTION 

The present invention relates to methods and compositions for treating nucleic 
acid, and in particular, methods and compositions for detection and characterization of 
nucleic acid sequences and sequence changes. 

BACKGROUND OF THE INVENTION 

The detection and characterization of specific nucleic acid sequences and 
sequence changes have been utilized to detect the presence of viral or bacterial nucleic 
acid sequences indicative of an infection, the presence of variants or alleles of 
mammalian genes associated with disease and cancers, and the identification of the 
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source of nucleic acids found in forensic samples, as well as in paternity 
determinations. 

Various methods are known in the art which may be used to detect and 
characterize specific nucleic acid sequences and sequence changes. Nonetheless, as 
5 nucleic acid sequence data of the human genome, as well as the genomes of 

pathogenic organisms accumulates, the demand for fast, reliable, cost-effective and 
user-friendly tests for specific sequences continues to grow. Importantly, these tests 
must be able to create a detectable signal from a very low copy number of the 
sequence of interest. The following discussion examines three levels of nucleic acid 
10 detection currently in use: I. Signal Amplification Technology for detection of rare 

sequences; II. Direct Detection Technology for detection of higher copy number 
sequences; and III. Detection of Unknown Sequence Changes for rapid screening of 
sequence changes anywhere within a defined DNA fragment. 

I. Signal Amplification Technology Methods For Amplification 

15 The "Polymerase Chain Reaction" (PCR) comprises the first generation of 

methods for nucleic acid amplification. However, several other methods have been 
developed that employ the same basis of specificity, but create signal by different 
amplification mechanisms. These methods include the "Ligase Chain Reaction" 
(LCR), "Self-Sustained Synthetic Reaction" (3SR/NASBA), and "QP-Replicase" (QP). 



20 Polymerase Chain Reaction (PCR) 

The polymerase chain reaction (PCR), as described in U.S. Patent 
Nos. 4,683,195 and 4,683,202 to Mullis and Mullis et al, describe a method for 
increasing the concentration of a segment of target sequence in a mixture of genomic 
DNA without cloning or purification. This technology provides one approach to the 

25 problems of low target sequence concentration. PCR can be used to directly increase 

the concentration of the target to an easily detectable level. This process for 
amplifying the target sequence involves introducing a molar excess of two 
oligonucleotide primers which are complementary to their respective strands of the 
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double-stranded target sequence to the DNA mixture containing the desired target 
sequence. The mixture is denatured and then allowed to hybridize. Following 
hybridization, the primers are extended with polymerase so as to form complementary 
strands. The steps of denaturation, hybridization, and polymerase extension can be 
repeated as often as needed, in order to obtain relatively high concentrations of a 
segment of the desired target sequence. 

The length of the segment of the desired target sequence is determined by the 
relative positions of the primers with respect to each other, and, therefore, this length 
is a controllable parameter. Because the desired segments of the target sequence 
become the dominant sequences (in terms of concentration) in the mixture, they are 
said to be "PCR-amplified." 



Ligase Chain Reaction (LCR or LAR) 

The ligase chain reaction (LCR; sometimes referred to as "Ligase Amplification 
Reaction" (LAR) described by Barany, Proc. Natl. Acad. Sci., 88:189 (1991); Barany, 
5 PGR Methods and Applic, 1 :5 (1991); and Wu and Wallace, Genomics 4:560 (1989) 

has developed into a well-recognized alternative method for amplifying nucleic acids. 
In LCR, four oligonucleotides, two adjacent oligonucleotides which uniquely hybridize 
to one strand of target DNA, and a complementary set of adjacent oligonucleotides, 
which hybridize to the opposite strand are mixed and DNA ligase is added to the 
20 mixture. Provided that there is complete complementarity at the junction, ligase will 

covalently link each set of hybridized molecules. Importantly, in LCR, two probes are 
ligated together only when they base-pair with sequences in the target sample, without 
gaps or mismatches. Repeated cycles of denaturation, hybridization and ligation 
amplify a short segment of DNA. LCR has also been used in combination with PCR 
25 to achieve enhanced detection of single-base changes. Segev, PCT Public. 

No. W09001069 Al (1990). However, because the four oligonucleotides used in this 
assay can pair to form two short ligatable fragments, there is the potential for the 
generation of target-independent background signal. The use of LCR for mutant 
screening is limited to the examination of specific nucleic acid positions. 
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Self-Sustained Synthetic Reaction (3SR/NASBA) 

The self-sustained sequence replication reaction (3SR) (Guatelli et a,., Proc. 
Na ,l Acad. Sci., 87:1874-1878 [1990], with an erratum at Proc. Natl. Acad. So 
87-7797 [1990]) is a transcription-based in vitro amplification system (Kwok e, al, 
5 Proc Natl. Acad. Sc., 86:1173-1177 [1989]) that can exponentially amphfy RNA 
sequences at a uniform temperature. The amphfied RNA can then be utilized for 
mutation detection (Fahy et a,., PCR Meth. Appl., .-.5-33 [199,]). In th.s method 
oligonucleotide pnmer is used ,0 add a phage RNA polymerase promoter to he 
of the sequence of interest. In a cocktai, of enzymes and substrates that mcludes a 
10 second primer, reverse transcnptase, RNase H, RNA polymerase and „bo-»d 

deoxynbonucleoside triphosphates, the target sequence undergoes repeated rounds 
transcription, cDNA synthesis and second-strand synthesis to amplify the area of 
interest. The use of 3SR to detect mutations ,s Kinetically l.m.ted ,o screemng small 
segments of DNA (e.g., 200-300 base pairs). 

, , Q-Beta (Q/3) RepHcase 

,„ th,s method, a probe whrch recognizes the sequence of interest ,s attached to 
the rephcatable RNA template for QP replicase. A previously identified major 
problem with false positives resulting from the replication of unhybndized probes as 
been addressed through use of a sequence-spectfic l.gation step. However, avadable 
20 thermostable DNA Hgases are not effective on this RNA substrate, so the l.gat.on mus 
be performed by T4 DNA Hgase at low temperatures (37°C). This prevents the use of 
high temperature as a means of aclueving specific,* as in the LCR, the Hgation event 
can be used to detect a mutation a. the junction site, but no. elsewhere. 

Table 1 below, Hs.s some of the features desirable for systems useful m 
25 sensitive nucleic acd d.agnostics, and summaries the ab.lit.es of each of the major 
amplification methods (See also, Landgren, Trends in Genetics 9:199 [1993]). 

A successful diagnostic method must be very specific. A straight-forward 
me thod of controlling the specificity of nucle.c acid hybridization ,s by controllmg the 
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temperature of the reaction. While the 3SR/NASBA, and QP systems are all able to 
generate a large quantity of s.gnal, one or more of the enzymes involved m each 
cannot be used at high temperature (,.,, >55°C). Therefore the reaction temperatures 
cannot be raised to prevent non-specific hybridization of the probes. If probes are 
shortened in order to make ,hem melt more easily at low temperatures, the hkehhood 
of having more than one perfect match in a complex genome mcreases. For these 
reasons PGR and LCR currently dominate the research field m detection technolog.es. 
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The basts of the amplification procedure in the PCR and LCR is the fact that 
the products of one cycle become usable templates in all subsequent cycles, 
20 consequently doubling the population wtth each cycle. The final yield of any such 
doubling system can be expressed as: (1 + X)" = y, where »X» is the mean effictency 
(percent copied in each cycle), V is the number of cycles, and »y» is the overall 
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efficiency, or yield of the reaction (Mullis, PCR Methods Applic, 1:1 [1991]). If 
every copy of a target DNA is utilized as a template in every cycle of a polymerase 
chain reaction, then the mean efficiency is 100%. If 20 cycles of PCR are performed, 
then the yield will be 2» or 1,048,576 copies of the starting material. If the reaction 
conditions reduce the mean efficiency to 85%, then the yield in those 20 cycles will be 
only 1 85 20 or 220,513 copies of the starting material. In other words, a PCR running 
at 85% efficiency will yield only 21% as much final product, compared to a reaction 
running at 100% efficiency. A reaction that is reduced to 50% mean efficiency will 
yield less than 1% of the possible product. 

In practice, routine polymerase chain reactions rarely achieve the theoretical 
maximum yield, and PCRs are usually run for more than 20 cycles to compensate for 
the lower yield. At 50% mean efficiency, it would take 34 cycles to achieve the 
million-fold amplification theoretically possible in 20, and at lower efficiencies, the 
number of cycles required becomes prohibitive. In addition, any background products 
that amplify with a better mean efficiency than the intended target will become the 

dominant products. 

Also, many variables can influence the mean efficiency of PCR, including 
target DNA length and secondary structure, primer length and design, primer and 
dNTP concentrations, and buffer composite, to name but a few. Contammatton of 
the reaction with exogenous DNA (e.g., DNA spilled onto lab surfaces) or cross- 
contamination ,s also a major consrderation. Reaction conditions must be carefully 
optimized for each different primer pair and target sequence, and the process can take 
days even for an experienced investigator. The laboriousness of this process, 
including numerous technical considerations and other factors, presents a sigmficant 
drawback to using PCR in the climcal setting. Indeed, PCR has ye. to penetrate the 
clinical market in a sigmficant way. The same concerns arise with LCR, as LCR must 
also be optimized to use different oligonucleotide sequences for each targe, sequence. 
In addition, both methods require expensive equipment, capable of precise temperature 
cycling. 
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Many applications of nucleic acid detection technologies, such as in studies of 
allelic variation, involve not only detection of a specific sequence in a complex 
background, but also the discrimination between sequences with few, or single, 
nucleotide differences. One method for the detection of allele-specific variants by 
5 PGR is based upon the fact that it is difficult for Tag polymerase to synthesize a DNA 

strand when there is a mismatch between the template strand and the 3' end of the 
primer. An allele-specific variant may be detected by the use of a primer that is 
perfectly matched with only one of the possible alleles; the mismatch to the other 
allele acts to prevent the extension of the primer, thereby preventing the amplification 
10 of that sequence. This method has a substantial limitation in that the base composition 

of the mismatch influences the ability to prevent extension across the mismatch, and 
certain mismatches do not prevent extension or have only a minimal effect (Kwok et 
ai, Nucl. Acids Res., 18:999 [1990]).) 

A similar 3'-mismatch strategy is used with greater effect to prevent ligation in 
15 the LCR (Barany, PCR Meth. Applic, 1:5 [1991]). Any mismatch effectively blocks 

the action of the thermostable ligase, but LCR still has the drawback of 
target-independent background ligation products initiating the amplification. 
Moreover, the combination of PCR with subsequent LCR to identify the nucleotides at 
individual positions is also a clearly cumbersome proposition for the clinical 
20 laboratory. 

II. Direct Detection Technology 

When a sufficient amount of a nucleic acid to be detected is available, there are 
advantages to detecting that sequence directly, instead of making more copies of that 
target, (e.g., as in PCR and LCR). Most notably, a method that does not amplify the 
25 signal exponentially is more amenable to quantitative analysis. Even if the signal is 

enhanced by attaching multiple dyes to a single oligonucleotide, the correlation 
between the final signal intensity and amount of target is direct. Such a system has an 
additional advantage that the products of the reaction will not themselves promote 
further reaction, so contamination of lab surfaces by the products is not as much of a 

- 7 - 




concern Traditional methods of direct detection including Northern and Southern 
blotting and RNase protection assays usually require the use of radioactivity and are 
no, amenable to automation. Recently devised techniques have sough, to eliminate the 
use of radioactivtty and/or tmprove the sensitivity in automatable formats. Two 
5 examples are the "Cycling Probe Reaction" (CPR), and "Branched DNA" (bDNA) 

The cycltng probe reaction (CPR) (Duck e, al, BioTech., 9:142 [1990]), uses a 
long chimeric ol.gonucleotide in which a central portion is made of RNA wh.le the 
two termini are made of DNA. Hybridization of the probe to a target DNA and 
exposure to a thermostable RNase H causes the RNA portion to be digested. Th,s 
10 destabilizes the remaining DNA portions of the duplex, releasing the remainder of the 
probe from the target DNA and allowing another probe molecule to repeat the process. 
The signal, in the form of cleaved probe molecules, accumulates at a lmear rate. 
While the repeating process increases the signal, the RNA portion of the 
oligonucleotide is vulnerable to RNases that may carried through sample preparation. 
15 Branched DNA (bDNA), described by Urdea et al, Gene 61:253-264 (1987), 

involves oligonucleotides with branched structures that allow each individual 
oligonucleotide to carry 35 to 40 labels (e.g., alkaline phosphatase enzymes). Whtle 
this enhances the signal from a hybridization event, signal from non-speciftc btndtng ,s 
similarly increased. 

20 HI. Detection Of Unknown Sequence Changes 

The demand for tests winch allow the detection of specific nucleic acid 
sequences and sequence changes is growing rapidly in clinical diagnostics. As nucleic 
acid sequence data for genes from humans and pathogenic organisms accumulates, the 
demand for fast, cost-effective, and easy-to-use tests for as yet unknown mutations 
25 within specific sequences is rapidly increasing. 

A handful of methods have been devised to scan nucleic acid segments for 
mutations. One option is to determine the entire gene sequence of each test sample 
(e g. a bacterial isolate). For sequences under approximately 600 nucleotides, this 
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may be accomplished using amplified material {e.g., PCR reaction products). This 
avoids the time and expense associated with cloning the segment of interest. However, 
specialized equipment and highly trained personnel are required, and the method is too 
labor-intense and expensive to be practical and effective in the clinical setting. 
5 In view of the difficulties associated with sequencing, a given segment of 

nucleic acid may be characterized on several other levels. At the lowest resolution, the 
slZ e of the molecule can be determined by electrophoresis by comparison to a known 
standard run on the same gel. A more detailed picture of the molecule may be 
achieved by cleavage with combinations of restriction enzymes prior to electrophoresis, 
0 to allow construction of an ordered map. The presence of specific sequences within 

the fragment can be detected by hybridization of a labeled probe, or the precise 
nucleotide sequence can be determined by partial chemical degradation or by primer 
extension in the presence of chain- terminating nucleotide analogs. 

For detection of single-base differences between like sequences, the 
15 requirements of the analysis are often at the highest level of resolution. For cases in 

which the position of the nucleotide in question is known m advance, several methods 
have been developed for examining single base changes without direct sequencing. 
For example, if a mutation of interest happens to fall within a restriction recognition 
sequence, a change in the pattern of digestion can be used as a diagnostic tool {e.g., 
20 restriction fragment length polymorphism [RFLP] analysis). 

Single point mutations have been also detected by the creation or destruction of 
RFLPs. Mutations are detected and localized by the presence and size of the RNA 
fragments generated by cleavage at the mismatches. Single nucleotide mismatches in 
DNA heteroduplexes are also recognized and cleaved by some chemicals, providing an 
25 alternative strategy to detect single base substitutions, generically named the 

"Mismatch Chemical Cleavage" (MCC) (Gogos et al, Nucl. Acids Res., 18:6807-6817 
[1990]). However, this method requires the use of osmium tetroxide and piperidine, 
two highly noxious chemicals which are not suited for use in a clinical laboratory. 
RFLP analysis suffers from low sensitivity and requires a large amount of 
30 sample. When RFLP analysis is used for the detection of point mutations, it is, by its 
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nature, limited to the detection of only those single base changes which fall within a 
restriction sequence of a known restriction endonuclease. Moreover, the majority of 
the available enzymes have 4 to 6 base-pair recognition sequences, and cleave too 
frequently for many large-scale DNA manipulations (Eckstein and Lilley (eds.), 
Nucleic Acids and Molecular Biology, vol. 2, Springer-Verlag, Heidelberg [1988]). 
Thus, it is applicable only in a small fraction of cases, as most mutations do not fall 
within such sites. 

A handful of rare-cutting restriction enzymes with 8 base-pair specificities have 
been isolated and these are widely used in genetic mapping, but these enzymes are few 
in number, are limited to the recognition of G+C-rich sequences, and cleave at sites 
that tend to be highly clustered (Barlow and Lehrach, Trends Genet., 3:167 [1987]). 
Recently, endonucleases encoded by group I introns have been discovered that might 
have greater than 12 base-pair specificity (Perlman and Butow, Science 246:1106 
[1989]), but again, these are few in number. 

If the change is not in a recognition sequence, then allele- specific 
oligonucleotides (ASOs), can be designed to hybridize in proximity to the unknown 
nucleotide, such that a primer extension or ligation event can be used as the indicator 
of a match or a mis-match. Hybridization with radioactively labeled allelic specific 
oligonucleotides (ASO) also has been applied to the detection of specific point 
mutations (Conner et al., Proc. Natl. Acad. Sci., 80:278-282 [1983]). The method is 
based on the differences in the melting temperature of short DNA fragments differing 
by a single nucleotide. Stringent hybridization and washing conditions can 
differentiate between mutant and wild-type alleles. The ASO approach applied to PCR 
products also has been extensively utilized by various researchers to detect and 
characterize point mutations in ras genes (Vogelstein et al, N. Eng. J. Med., 319:525- 
532 [1988]; and Farr et al, Proc. Natl. Acad. Sci., 85:1629-1633 [1988]), and gsp/gip 
oncogenes (Lyons et al., Science 249:655-659 [1990]). Because of the presence of 
various nucleotide changes in multiple positions, the ASO method requires the use of 
many oligonucleotides to cover all possible oncogenic mutations. 
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With either of the techniques described above (i.e., RFLP and ASO), the 
precise location of the suspected mutation must be known in advance of the test. That 
is to say, they are inapplicable when one needs to detect the presence of a mutation of 
an unknown character and position within a gene or sequence of interest. 

Two other methods rely on detecting changes in electrophoretic mobility in 
response to minor sequence changes. One of these methods, termed "Denaturing 
Gradient Gel Electrophoresis" (DGGE) is based on the observation that slightly 
different sequences will display different patterns of local melting when 
electrophoretically resolved on a gradient gel. In this manner, variants can be 
distinguished, as differences in melting properties of homoduplexes versus 
heteroduplexes differing in a single nucleotide can detect the presence of mutations in 
the target sequences because of the corresponding changes in their electrophoretic 
mobilities. The fragments to be analyzed, usually PCR products, are "clamped" at one 
end by a long stretch of G-C base pairs (30-80) to allow complete denaturation of the 
sequence of interest without complete dissociation of the strands. The attachment of a 
GC "clamp" to the DNA fragments increases the fraction of mutations that can be 
recognized by DGGE (Abrams et al., Genomics 7:463-475 [1990]). Attaching a GC 
clamp to one primer is critical to ensure that the amplified sequence has a low 
dissociation temperature (Sheffield et al., Proc. Natl. Acad. Sci., 86:232-236 [1989]; 
and Lerman and Silverstem, Meth. Enzymol., 155:482-501 [1987]). Modifications of 
the technique have been developed, using temperature gradients (Wartell et al, Nucl. 
Acids Res., 18:2699-2701 [1990]), and the method can be also applied to RNA:RNA 
duplexes (Smith et al., Genomics 3:217-223 [1988]). 

Limitations on the utility of DGGE include the requirement that the denaturing 
conditions must be optimized for each type of DNA to be tested. Furthermore, the 
method requires specialized equipment to prepare the gels and maintain the needed 
high temperatures during electrophoresis. The expense associated with the synthesis of 
the clamping tail on one oligonucleotide for each sequence to be tested is also a major 
consideration. In addition, long running times are required for DGGE. The long 
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running time of DGGE was shortened in a modification of DGGE called constant 
denaturant gel electrophoresis (CDGE) (Borrensen et al, Proc. Natl. Acad. Sci. USA 
88:8405 [1991]). CDGE requires that gels be performed under different denaturant 
conditions in order to reach high efficiency for the detection of unknown mutations. 

An technique analogous to DGGE, termed temperature gradient gel 
electrophoresis (TGGE), uses a thermal gradient rather than a chemcial denaturant 
gradient (Scholz, et al, Hum. Mol. Genet. 2:2155 [1993]). TGGE requires the use of 
specialized equipment which can generate a temperature gradient perpendicularly 
oriented relative to the electrical field. TGGE can detect mutations in relatively small 
fragments of DNA therefore scanning of large gene segments requires the use of 
multiple PCR products prior to running the gel. 

Another common method, called "Single- Strand Conformation Polymorphism" 
(SSCP) was developed by Hayashi, Sekya and colleagues (reviewed by Hayashi, PCR 
Meth. Appl., 1:34-38, [1991]) and is based on the observation that single strands of 
nucleic acid can take on characteristic conformations in non-denaturing conditions, and 
these conformations influence electrophoretic mobility. The complementary strands 
assume sufficiently different structures that one strand may be resolved from the other. 
Changes in sequences within the fragment will also change the conformation, 
consequently altering the mobility and allowing this to be used as an assay for 
sequence variations (Orita, et al, Genomics 5:874-879, [1989]). 

The SSCP process involves denaturing a DNA segment {e.g., a PCR product) 
that is labelled on both strands, followed by slow electrophoretic separation on a non- 
denaturing polyacrylamide gel, so that intra-molecular interactions can form and not be 
disturbed during the run. This technique is extremely sensitive to variations in gel 
composition and temperature. A serious limitation of this method is the relative 
difficulty encountered in comparing data generated in different laboratories, under 

apparently similar conditions. 

The dideoxy fingerprinting (ddF) is another technique developed to scan genes 
for the presence of unknown mutations (Liu and Sommer, PCR Methods Appli., 4:97 
[1994]). The ddF technique combines components of Sanger dideoxy sequencing with 
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SSCP. A dideoxy sequencing reaction is performed using one dideoxy terminator and 
then the reaction products are electrophoresised on nondenaturing polyacrylamide gels 
to detect alterations in mobility of the termination segments as in SSCP analysis. 
While ddF is an improvement over SSCP in terms of increased sensitivity, ddF 
5 requires the use of expensive dideoxynucleotides and this technique is still limited to 

the analysis of fragments of the size suitable for SSCP (i.e., fragments of 200-300 
bases for optimal detection of mutations). 

In addition to the above limitations, all of these methods are limited as to the 
size of the nucleic acid fragment that can be analyzed. For the direct sequencing 
10 approach, sequences of greater than 600 base pairs require cloning, with the 

consequent delays and expense of either deletion sub-cloning or primer walking, in 
order to cover the entire fragment. SSCP and DGGE have even more severe size 
limitations. Because of reduced sensitivity to sequence changes, these methods are not 
considered suitable for larger fragments. Although SSCP is reportedly able to detect 
15 90% of single-base substitutions within a 200 base-pair fragment, the detection drops 

to less than 50% for 400 base pair fragments. Similarly, the sensitivity of DGGE 
decreases as the length of the fragment reaches 500 base-pairs. The ddF technique, as 
a combination of direct sequencing and SSCP, is also limited by the relatively small 
size of the DNA that can be screened. 
20 Clearly, there remains a need for a method that is less sensitive to size so that 

entire genes, rather than gene fragments, may be analyzed. Such a tool must also be 
robust, so that data from different labs, generated by researchers of diverse 
backgrounds and skills will be comparable. Ideally, such a method would be 
compatible with "multiplexing," (i.e., the simultaneous analysis of several molecules or 
25 genes in a single reaction or gel lane, usually resolved from each other by differential 

labelling or probing). Such an analytical procedure would facilitate the use of internal 
standards for subsequent analysis and data comparison, and increase the productivity of 
personnel and equipment. The ideal method would also be easily automatable. 

SUMMARY OF THE INVENTION 
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The present invention relates to methods and compositions for treating nucleic 
acid, and in particular, methods and compositions for detection and characterization of 
nucleic acid sequences and sequence changes in microbial gene sequences. The 
present invention provides means for cleaving a nucleic acid cleavage structure in a 
site-specific manner. In one embodiment, the means for cleaving is an enzyme 
capable of cleaving cleavage structures on a nucleic acid substrate, forming the bas 1S 
of a novel method of detection of specific nucleic acid sequences. The present 
invention contemplates use of the novel detection method for, among other uses, 
clinical diagnostic purposes, including but not limited to detection and identification of 

pathogenic organisms. 

In one embodiment, the present invention contemplates a DNA sequence 
encoding a DNA polymerase altered in sequence (i.e., a "mutant" DNA polymerase) 
relative to the native sequence such that it exhibits altered DNA synthetic activity from 
that of the native (i.e., "wild type") DNA polymerase. With regard to the polymerase, 
a complete absence of synthesis is not required; it is desired that cleavage reactions 
occur in the absence of polymerase activity at a level where it interferes with the 
method. It is preferred that the encoded DNA polymerase is altered such that it 
exhibits reduced synthetic activity from that of the native DNA polymerase. In this 
manner, the enzymes of the invention are nucleases and are capable of cleaving nucleic 
acids in a structure-specific manner. Importantly, the nucleases of the present 
invention are capable of cleaving cleavage structures to create discrete cleavage 
products. 

The present invention contemplates nucleases from a variety of sources, 
including nucleases that are thermostable. Thermostable nucleases are contemplated as 
particularly useful, as they are capable of operating at temperatures where nucleic acid 
hybridization is extremely specific, allowing for allele-specific detection (including 
single-base mismatches). In one embodiment, the thermostable 5' nucleases are 
selected from the group consisting of altered polymerases derived from the native 
polymerases of various Thermus species, including, but not limited to Thermus 
aquaticus, Thermus flavus and Thermus thermophilus. 
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The present invention utilizes such enzymes in methods for detection and 
characterization of nucleic acid sequences and sequence changes. The present 
invention relates to means for cleaving a nucleic acid cleavage structure in a site- 
specific manner. Nuclease activity is used to screen for known and unknown 
mutations, including single base changes, in nucleic acids. 

In one embodiment, the present invention contemplates a process or method for 
identifying strains of microorganisms comprising the steps of providing a cleavage 
means and a nucleic acid substrate containing sequences derived from one or more 
microorganism; treating the nucleic acid substrate under conditions such that the 
substrate forms one or more cleavage structures; and reacting the cleavage means with 
the cleavage structures so that one or more cleavage products are produced. In one 
embodiment of this invention, the cleavage means is an enzyme. In one preferred 
embodiment, the enzyme is a nuclease. In an alternative preferred embodiment, the 
nuclease is selected from the group consisting of Cleavase™ BN enzyme, Thermus 
aquations DNA polymerase, Thermus thermophilus DNA polymerase, Escherichia coli 
Exo III, and the Saccharomyces cerevisiae Radl/RadlO complex. It is also 
contemplated that the enzyme may have a portion of its amino acid sequence that is 
homologous to a portion of the amino acid sequence of a thermostable DNA 
polymerase derived from a eubacterial thermophile, the latter being selected from the 
20 group consisting of Thermus aquations, Thermus flavus and Thermus thermophilus. 

It is contemplated that the nucleic acid substrate comprise a nucleotide analog, 
including but not limited to the group comprising 7-deaza-dATP, 7-deaza-dGTP and 
dUTP. In one embodiment, the nucleic acid substrate is substantially single-stranded. 
It is not intended that the nucleic acid substrate be limited to any particular form, 
25 indeed, it is contemplated that the nucleic acid substrate is single stranded or double- 

stranded RNA or DNA. 

In one embodiment of the present invention, the treating step comprises 
rendering double- stranded nucleic acid substantially single-stranded, and exposing the 
single-stranded nucleic acid to conditions such that the single-stranded nucleic acid 
30 assumes a secondary or characteristic folded structure. In one preferred embodiment, 
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double-stranded nucleic acid is rendered substantially single-stranded by increased 
temperature. 

In an alternative embodiment, the method of the present invention further 
comprises the step of detecting one or more cleavage products. 
5 It is contemplated that the microorganism(s) of the present invention be 

selected from a variety of microorganisms. It is not intended that the present invention 
be limited to any particular type of microorganism. Rather, it is intended that the 
present invention be used with organisms including, but not limited to, bacteria, fungi, 
protozoa, ciliates, and viruses. It is not intended that the microorganisms be limited to 
10 a particular genus, species, strain, or serotype. Indeed, it is contemplated that the 

bacteria be selected from the group including, but not limited to members of the 
genera Campylobacter, Escherichia, Mycobacterium, Salmonella, Shigella,^ 
Staphylococcus. In one preferred embodiment, the microorganism(s) comprise strains 
of multi-drug resistant Mycobacterium tuberculosis. It is also contemplated that the 
15 present invention be used with viruses, including but not limited to hepatitis C virus 

and simian immunodeficiency virus. 

Another embodiment of the present invention contemplates a method for 
detecting and identifying strains of microorganisms, comprising the steps of extracting 
nucleic acid from a sample suspected of containing one or more microorganisms; and 
20 contacting the extracted nucleic acid with a cleavage means under conditions such that 

the extracted nucleic acid forms one or more secondary structures, and the cleavage 
means cleaves the secondary structures to produce one or more cleavage products. 

In one embodiment, the method further comprises the step of separating the 
cleavage products. In yet another embodiment, the method further comprises the step 
25 of detecting the cleavage products. 

In one preferred embodiment, the present invention further comprises 
comparing the detected cleavage products generated from cleavage of the extracted 
nucleic acid isolated from the sample with separated cleavage products generated by 
cleavage of nucleic acids derived from one or more reference microorganisms. In such 
30 a case the sequence of the nucleic acids from one or more reference microorganisms 
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may be related but different {e.g., a wild type control for a mutant sequence or a 
known or previously characterized mutant sequence). 

In an alternative preferred embodiment, the present invention further comprises 
the step of isolating a polymorphic locus from the extracted nucleic acid after the 
5 extraction step, so as to generate a nucleic acid substrate, wherein the substrate is 

contacted with the cleavage means. In one embodiment, the isolation of a 
polymorphic locus is accomplished by polymerase chain reaction amplification. In an 
alternate embodiment, the polymerase chain reaction is conducted in the presence of a 
nucleotide analog, including but not limited to the group comprising 7-deaza-dATP, 7- 
10 deaza-dGTP and dUTP. It is contemplated that the polymerase chain reaction 

amplification will employ oligonucleotide primers matching or complementary to 
consensus gene sequences derived from the polymorphic locus. In one embodiment, 
the polymorphic locus comprises a ribosomal RNA gene. In a particularly preferred 
embodiment, the ribosomal RNA gene is a 16S ribosomal RNA gene. 
15 In one embodiment of this method, the cleavage means is an enzyme. In one 

preferred embodiment, the enzyme is a nuclease. In a particularly preferred 
embodiment, the nuclease is selected from the group including, but not limited to 
Cleavase™ BN enzyme, Thermus aquations DNA polymerase, Thermus thermophilus 
DNA polymerase, Escherichia coli Exo III, and the Saccharomyces cerevisiae 
20 Radl/RadlO complex. It is also contemplated that the enzyme may have a portion of 

its amino acid sequence that is homologous to a portion of the amino acid sequence of 
a thermostable DNA polymerase derived from a eubacterial thermophile, the latter 
being selected from the group consisting of Thermus aquaticus, Thermus JIavus and 
Thermus thermophilus. 

25 It is contemplated that the nucleic acid substrate of this method will comprise a 

nucleotide analog, including but not limited to the group comprising 7-deaza-dATP, 7- 
deaza-dGTP and dUTP. In one embodiment, the nucleic acid substrate is substantially 
single-stranded. It is not intended that the nucleic acid substrate be limited to any 
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particular form, indeed, it is contemplated that the nucleic acid substrate is single 
stranded or double-stranded RNA or DNA. 

In another embodiment of the present invention, the treating step of the method 
comprises rendering double-stranded nucleic acid substantially single- stranded, and 
5 exposing the single- stranded nucleic acid to conditions such that the single- stranded 

nucleic acid has secondary structure. In one preferred embodiment, double-stranded 
nucleic acid is rendered substantially single-stranded by increased temperature. 

It is contemplated that the microorganism(s) of the present invention be 
selected from a variety of microorganisms; it is not intended that the present invention 
10 be limited to any particular type of microorganism. Rather, it is intended that the 

present invention will be used with organisms including, but not limited to, bacteria, 
fungi, protozoa, ciliates, and viruses. It is not intended that the microorganisms be 
limited to a particular genus, species, strain, or serotype. Indeed, it is contemplated 
that the bacteria be selected from the group comprising, but not limited to members of 
15 the genera Campylobacter, Escherichia, Mycobacterium, Salmonella, Shigella,^ 

Staphylococcus. In one preferred embodiment, the microorganism(s) comprise strains 
of multi-drug resistant Mycobacterium tuberculosis. It is also contemplated that the 
present invention be used with viruses, including but not limited to hepatitis C virus 
and simian immunodeficiency virus. 
2 0 In yet another embodiment, the present invention contemplates a method for 

treating nucleic acid comprising an oligonucleotide containing microbial gene 
sequences, comprising providing a cleavage means in a solution containing manganese 
and nucleic acid substrate containing microbial gene sequences; treating the nucleic 
acid substrate with increased temperature such that the substrate is substantially single- 
25 stranded; reducing the temperature under conditions such that the single- stranded 

substrate forms one or more cleavage structures; reacting the cleavage means with the 
cleavage structures so that one or more cleavage products are produced; and detecting 
the one or more cleavage products produced by the method. 

The present invention also contemplates a process for creating a record 
30 reference library of genetic fingerprints characteristic (/. e. , diagnostic) of one or more 
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alleles of the various microorganisms, comprising the steps of providing a cleavage 
means and nucleic acid substrate derived from microbial gene sequences; contacting 
the nucleic acid substrate with a cleavage means under conditions such that the 
extracted nucleic acid forms one or more secondary structures and the cleavage means 
5 cleaves the secondary structures, resulting in the generation of multiple cleavage 

products; separating the multiple cleavage products; and maintaining a testable record 
reference of the separated cleavage products. 

By the term "genetic fingerprint" it is meant that changes in the sequence of the 
nucleic acid (e.g., a deletion, insertion or a single point substitution) alter the structures 
10 formed, thus changing the banding pattern (i.e., the "fingerprint" or "bar code") to 

reflect the difference in the sequence, allowing rapid detection and identification of 
variants. 

The methods of the present invention allow for simultaneous analysis of both 
strands (e.g., the sense and antisense strands) and are ideal for high-level multiplexing. 
15 The products produced are amenable to qualitative, quantitative and positional analysis. 

The methods may be automated and may be practiced in solution or in the solid phase 
(e.g., on a solid support). The methods are powerful in that they allow for analysis of 
longer fragments of nucleic acid than current methodologies. 

DESCRIPTION OF THE DRAWINGS 

20 Figure 1 A provides a schematic of one embodiment of the detection method of 

the present invention. 

Figure lB"provides a schematic of a second embodiment of the detection 

f method of the present invention. 

j^V\ \k Figure^! a comparison of the nucleotide structure of the DNAP genes 

^ 25 isolated from Thermus aquaticus (SEQ ID NO:l), Thermus jlavus (SEQ ID NO:2) and 

Thermus thermophilus (SEQ ID NO:3); the consensus sequence (SEQ ID NO:7) is 
shown at the top of each row. 
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Figure 3 jfe a comparison of the amino acid sequence of the DNAP isolated 
from Thermus aquations (SEQ ID NO:4), Thermus flavus (SEQ ID NO:5), and 
Thermus thermophilus(SEQ ID NO:6); the consensus sequence (SEQ ID NO:8) is 

shown at the top of each row. 

Figures 4A-G are a set of diagrams of wild-type and synthesis-deficient 

DNAPTcu? genes. 

Figure 5 A depicts the wild-type Thermus jlavus polymerase gene. 
Figure 5B depicts a synthesis-deficient Thermus flavus polymerase gene. 
Figure 6 depicts a structure which cannot be amplified using DNAPTaq; Figure 
6 shows the hairpin of SEQ ID NO:15 annealed with the primer of SEQ ID NO:17. 

Figure 7 is a ethidium bromide-stained gel demonstrating attempts to amplify a 
bifurcated duplex using either DNAP Tag or DNAPStf (Stoffel). 

Figure 8 is an autoradiogram of a gel analyzing the cleavage of a bifurcated 
duplex by DNAP Tag and lack of cleavage by DNAPStf. 

Figures 9A-B are a set of autoradiograms of gels analyzing cleavage or lack of 
cleavage upon addition of different reaction components and change of incubation 
temperature during attempts to cleave a bifurcated duplex with DNAP Tag. 

Figures 10A-B are an autoradiogram displaying timed cleavage reactions, with 

and without primer. 

Figures 11A-B are a set of autoradiograms of gels demonstrating attempts to 
cleave a bifurcated duplex (with and without primer) with various DNAPs. 

Figures 12A shows the substrates and oligonucleotides used to test the specific 
cleavage of substrate DNAs targeted by pilot oligonucleotides. 

Figure 12B shows an autoradiogram of a gel showing the results of cleavage 
reactions using the substrates and oligonucleotides shown Fig. 12A. 

Figure 13A shows the substrate and oligonucleotide used to test the specific 
cleavage of a substrate RNA targeted by a pilot oligonucleotide. 

Figure 13B shows an autoradiogram of a gel showing the results of a cleavage 
reaction using the substrate and oligonucleotide shown in Fig. 13 A. 
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Figure 14,js a diagram of vector pTTQ18. 
Figure 15 is a diagram of vector pET-3c. 

Figure 16A-E depicts a set of molecules which are suitable substrates for 
cleavage by the 5' nuclease activity of DNAPs. 

Figure 17 is an autoradiogram of a gel showing the results of a cleavage 
reaction run with synthesis-deficient DNAPs. 

Figure 18 is an autoradiogram of a PEI chromatogram resolving the products of 
an assay for synthetic activity in synthesis-deficient DNAPTa? clones. 

Figure 19A depicts the substrate molecule used to test the ability of synthesis- 
deficient DNAPs to cleave short hairpin structures. 

Figure 19B shows an autoradiogram of a gel resolving the products of a 
cleavage reaction run using the substrate shown in Fig. 19A. 

Figure 20A shows the A- and T-hairpin molecules (SEQ ID NOS:23 and 24, 
respectively) used in the trigger/detection assay. 

Figure 20B shows the sequence of the alpha primer SEQ ID NO:25) used in 

the trigger/detection assay. 

Figure 20C shows the structure of the cleaved A- and T-hairpin molecules 
(nucleotides 23-72 of SEQ ID NO:23 and SEQ ID NO:27, respectively). 

Figure 20D depicts the complementarity between the A- and T-hairpin 
molecules (SEQ ID NOS:23 and 24, respectively). 

Figure 21 provides the complete 206-mer duplex sequence (SEQ ID NO:32) 
employed as a substrate for the 5' nucleases of the present invention; SEQ ID 

NOS:33-39 are shown in Figure 23. 

Figures 22A and B show the cleavage of linear nucleic acid substrates (based 
on the 206-mer of Figure 21) by wild type DNAPs and 5' nucleases isolated from 
Thermus aquaticus and Thermus jlavus. 

Figure 23 provides a detailed schematic corresponding to the of one 
embodiment of the detection method of the present invention. 

Figure 24 shows the propagation of cleavage of the linear duplex nucleic acid 
structures of Figure 23 by the 5' nucleases of the present invention. 
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Figure 25 A shows the "nibbling" phenomenon detected with the DNAPs of the 
present invention. 

Figure 25B shows that the "nibbling" of Figure 25A is 5' nucleolytic cleavage 

and not phosphatase cleavage. 
5 Figure 2^emonstrates that the "nibbling" phenomenon is duplex dependent. 

Figure 27 A is a schematic showing how "nibbling" can be employed in a 
^^\p^ detection assay. 

A VV Figure 2tdemonstrates that "nibbling" can be target directed. 

Figure 29 is a schematic showing the CFLP™ method of generating a 
10 characteristic fingerprint from a nucleic acid substrate. 

Figure 30 shows an autoradiograph of a gel resolving the products of cleavage 
reactions run in the presence of either MgCl 2 or MnCl 2 . 

Figure 31 shows an autoradiograph of a gel resolving the products of cleavage 
reactions run on four similarly sized DNA substrates. 

Figure 32 shows an autoradiograph of a gel resolving the products of cleavage 
reactions run using a wild-type and two mutant tyrosinase gene substrates. 

Figure 33 shows an autoradiograph of a gel resolving the products of cleavage 
reactions run using either a wild-type or mutant tyrosinase substrate varying in length 

from 157 nucleotides to 1.587 kb. 

Figure 34 shows an autoradiograph of a gel resolving the products of cleavage 

reactions run in various concentrations of MnCl 2 . 

Figure 35 shows an autoradiograph of a gel resolving the products of cleavage 
reactions run in various concentrations of KC1. 

Figure 36 shows an autoradiograph of a gel resolving the products of cleavage 

25 reactions run for different lengths of time. 

Figure 37 shows an autoradiograph of a gel resolving the products of cleavage 

reactions run at different temperatures. 

Figure 38 shows an autoradiograph of a gel resolving the products of cleavage 
reactions run using different amounts of the Cleavase™ BN enzyme. 
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Figure 39 shows an autoradiograph of a gel resolving the products of cleavage 
reactions run using four different preparations of the DNA substrate. 

Figure 40 shows an autoradiograph of a gel resolving the products of cleavage 
reactions run on either the sense or antisense strand of four different tyrosinase gene 
5 substrates. 

Figure 41 shows an autoradiograph of a gel resolving the products of cleavage 
reactions run on a wild-type P-globin substrate in two different concentrations of KC1 
and at four different temperatures. 

Figure 42 shows an autoradiograph of a gel resolving the products of cleavage 
10 reactions run on two different mutant p-globin substrates in five different 

concentrations of KC1. 

Figure 43 shows an autoradiograph of a gel resolving the products of cleavage 
reactions run on a wild-type and three mutant p-globin substrates. 

Figure 44 shows an autoradiograph of a gel resolving the products of cleavage 

15 reactions run on an RNA substrate. 

Figure 45 shows an autoradiograph of a gel resolving the products of cleavage 
reactions run using either the Cleavase™ BN enzyme or Tag DNA polymerase as the 
5' nuclease. 

Figure 46 shows an autoradiograph of a gel resolving the products of cleavage 
20 reactions run on a double- stranded DNA substrate to demonstrate multiplexing of the 

cleavage reaction. 

Figure 47 shows an autoradiograph of a gel resolving the products of cleavage 
reactions run on double-stranded DNA substrates consisting of the 419 and 422 mutant 
alleles derived from exon 4 of the human tyrosinase gene in the presence of various 

25 concentrations of MnCl 2 . 

Figure 48 displays two traces representing two channel signals (JOE and FAM 
fluorescent dyes) for cleavage fragments derived from a cleavage reaction containing 
two differently labelled substrates (the wild-type and 422 mutant substrates derived 
from exon 4 of the tyrosinase gene). The thin lines represent the JOE-labelled wild- 
30 type substrate and the thick lines represent the FAM-labelled 422 mutant substrate. 
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Above the tracing is an autoradiograph of a gel resolving the products of cleavage 
reactions run on double- stranded DNA substrates consisting of the wild-type and 422 
mutant alleles derived from exon 4 of the tyrosinase gene. 

Figure 4fcte?icts the nucleotide sequence of six SIV LTR clones corresponding 

to SEQ ID NOS:76-81. 

A VV Figure 50 shows an autoradiograph of a gel resolving the products of cleavage 

reactions run on six different double- stranded SIV LTR substrates which contained a 
biotin label on the 5' end of the (-) strand. 

Figure 51 shows an autoradiograph of a gel resolving the products of cleavage 
10 reactions run on six different double- stranded SIV LTR substrates which contained a 

biotin label on the 5' end of the (+) strand. 

Figure 52 shows an autoradiograph of a gel resolving the products of single- 
stranded cleavage reactions run in various concentrations of NaCl. 

Figure 53 shows an autoradiograph of a gel resolving the products of single- 
1 5 stranded cleavage reactions run in various concentrations of (NH 4 ) 2 S0 4 . 

Figure 54 shows an autoradiograph of a gel resolving the products of single- 
stranded cleavage reactions run in increasing concentrations of KC1. 

Figure 55 shows an autoradiograph of a gel resolving the products of single- 
stranded cleavage reactions run in two concentrations of KC1 for various periods of 
20 time. 

Figure 56 shows an autoradiograph of a gel resolving the products of cleavage 
reactions run on either the single- stranded or double-stranded form of the same 
substrate. 

Figure 57 shows an autoradiograph of a gel resolving the products of double- 
25 stranded cleavage reactions run in various concentrations of KC1. 

Figure 58 shows an autoradiograph of a gel resolving the products of double- 
stranded cleavage reactions run in various concentrations of NaCl. 

Figure 59 shows an autoradiograph of a gel resolving the products of double- 
stranded cleavage reactions run in various concentrations of (NH 4 ) 2 S0 4 . 
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Figure 60 shows an autoradiograph of a gel resolving the products of double- 
stranded cleavage reactions run for various lengths of time. 

Figure 61 shows an autoradiograph of a gel resolving the products of double- 
stranded cleavage reactions run using various amounts of Cleavase™ BN enzyme for 

5 either 5 seconds or 1 minute. 

Figure 62 shows an autoradiograph of a gel resolving the products of double- 
stranded cleavage reactions run at various temperatures. 

Figure 63 shows an autoradiograph of a gel resolving the products of double- 
stranded cleavage reactions run using various amounts of Cleavase™ BN enzyme. 
10 Figure 64A shows an autoradiograph of a gel resolving the products of single- 

stranded cleavage reactions run in buffers having various pHs. 

Figure 64B shows an autoradiograph of a gel resolving the products of single- 
stranded cleavage reactions run in buffers having a pH of either 7.5 or 7.8. 

Figure 65 A shows an autoradiograph of a gel resolving the products of double- 
15 stranded cleavage reactions run in buffers having a pH of either 8.2 or 7.2. 

Figure 65B shows an autoradiograph of a gel resolving the products of double- 
stranded cleavage reactions run in buffers having a pH of either 7.5 or 7.8. 

Figure 66 shows an autoradiograph of a gel resolving the products of single- 
stranded cleavage reactions run in the presence of various amounts of human genomic 
20 DNA. 

Figure 67 shows an autoradiograph of a gel resolving the products of single- 
stranded cleavage reactions run using the T/I DNA polymerase in two different 

concentrations of KCl. 

Figure 68 shows an autoradiograph of a gel resolving the products of single- 
25 stranded cleavage reactions run using the Tth DNA polymerase in two different 

concentrations of KCl. 

Figure 69 shows an autoradiograph of a gel resolving the products of single- 
stranded cleavage reactions run using the E. coli Exo III enzyme in two different 
concentrations of KCl. 
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Figure 70 shows an autoradiograph of a gel resolving the products of single- 
stranded cleavage reactions run on three different tyrosinase gene substrates (SEQ ID 
NOS:47, 54 and 55) using either the Tth DNA polymerase, the E. coli Exo III or 

Cleavase™ BN enzyme. 

Figure 71 is a schematic drawing depicting the location of the 5' and 3' 

cleavage sites on a cleavage structure. 

Figure 72 shows an autoradiograph of a gel resolving the products of single- 
stranded cleavage reactions run on three different tyrosinase gene substrates (SEQ ID 
NOS:47, 54 and 55) using either Cleavase™ BN enzyme or the Radl/RadlO complex. 

Figure 73 shows an autoradiograph of a gel resolving the products of double- 
stranded cleavage reactions run on a wild-type and two mutant P-globin substrates. 

Figure 74A shows an autoradiograph of a gel resolving the products of single- 
stranded cleavage reactions run on a wild-type and three mutant P-globin substrates. 

Figure 74B shows an autoradiograph of a gel resolving the products of single- 
stranded cleavage reactions run on five mutant p-globin substrates. 

Figure 75 shows an autoradiograph of a gel resolving the products of double- 
stranded cleavage reactions which varied the order of addition of the reaction 
components. 

Figure 76 depicts the organization of the human p53 gene; exons are 
represented by the solid black boxes and are labelled 1-11. Five hot spot regions are 
shown as a blow-up of the region spanning exons 5-8; the hot spot regions are labelled 
A, A', B, C, and D. 

Figure 77 provides a schematic showing the use of a first 2-step PCR technique 
for the generation DNA fragments containing p53 mutations. 

Figure 78 provides a schematic showing the use of a second 2-step PCR 
technique for the generation DNA fragments containing p53 mutations. 

Figure 79 shows an autoradiograph of a gel resolving the products of cleavage 
reactions run on a wild-type and two mutant p53 substrates. 

Figure 80 shows an autoradiograph of a gel resolving the products of cleavage 
reactions run on a wild-type and three mutant p53 substrates. 

- 26 - 



a1 



,1*5 



10 



15 



20 



25 



Figure 81 shows an autoradiography of a gel resolving the products of cleavage 
reactions run on a wild-type and a mutant p53 substrate where the mutant and wild- 
type substrates are present in various concentrations relative to one another. 

Figure 82 ft provides an alignment of HCV clones 1.1 (SEQ ID NO: 121), 
HCV2.1 (SEQ ID NO:122), HCV3.1 (SEQ ID NO:123), HCV4.2 (SEQ ID NO:124), 
HCV6.1 (SEQ ID NO:125) and HCV7.1 (SEQ ID NO:126). 

Figure 83 shows a fluoroimager scan of a gel resolving the products of 
cleavage reactions run on six double-stranded HCV substrates labeled on either the 

sense or anti- sense strand. 

Figure 84 shows an autoradiogram of a gel resolving the products of cleavage 
reactions run on a wild-type and two mutant M. tuberculosis rpoB substrates. 

Figure 85A shows a fluoroimager scan of a gel resolving the products of 
cleavage reactions run on a wild-type and two mutant M. tuberculosis rpoB substrates 
prepared using either dTTP or dUTP. 

Figure 85B shows a fluoroimager scan of the gel shown in Figure 85A 
following a longer period of electrophoresis. 

Figure 86 shows an autoradiogram of a gel resolving the products of cleavage 
reactions run on a wild-type and three mutant M. tuberculosis katG substrates labeled 

on the sense strand. 

Figure 87 shows a fluoroimager scan of a gel resolving the products of 
cleavage reactions run on a wild-type and three mutant M. tuberculosis katG substrates 

labeled on the anti-sense strand. 

Figure 88*shows the location of primers along the sequence of the E. coli rrsE 
K 

gene (SEQ ID NO: 158). 

Figure 89 p^vides an alignment of the E. coli rrsE (SEQ ID NO:158), 
Cam.jejuniS (SEQ ID NO:159), and Stp.aureus (SEQ ID NO:160) rRNA genes with 
the location of consensus PCR rRNA primers indicated in bold type. 

Figure 90 shows a fluoroimager scan of a gel resolving the products of 
cleavage reactions run on four bacterial 16S rRNA substrates. 
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Figure 91 A shows a fluoroimager scan of a gel resolving the products of 
cleavage reactions run on five bacterial 16S rRNA substrates. 

Figure 91B shows bacterial a fluoroimager scan of a gel resolving the products 
of cleavage reactions run on five bacterial 16S rRNA substrates. 
5 Figure 92 shows bacterial a fluoroimager scan of a gel resolving the products 

of cleavage reactions run on various bacterial 16S rRNA substrates. 

Figure 93 shows bacterial a fluoroimager scan of a gel resolving the products 
of cleavage reactions run on eight bacterial 16S rRNA substrates. 

Figure 94 shows an autoradiogram of a gel resolving the products of cleavage 
10 reactions run on a wild-type and mutant tyrosinase gene substrates prepared using 

naturally occurring deoxynucleotides or deoxynucleotide analogs. 



DEFINITIONS 

To facilitate understanding of the invention, a number of terms are defined 

below. 

15 The term "gene" refers to a DNA sequence that comprises control and coding 

sequences necessary for the production of a polypeptide or precursor. The polypeptide 
can be encoded by a full length coding sequence or by any portion of the coding 
sequence so long as the desired enzymatic activity is retained. 

The term "wild-type" refers to a gene or gene product which has the 

20 characteristics of that gene or gene product when isolated from a naturally occurring 

source. A wild-type gene is that which is most frequently observed in a population 
and is thus arbitrarily designed the "normal" or "wild-type" form of the gene. In 
contrast, the term "modified" or "mutant" refers to a gene or gene product which 
displays modifications in sequence and or functional properties (i.e., altered 
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characteristics) when compared to the wild-type gene or gene product. It is noted that 
naturally-occurring mutants can be isolated; these are identified by the fact that they 
have altered characteristics when compared to the wild-type gene or gene product. 

The term "recombinant DNA vector" as used herein refers to DNA sequences 
containing a desired coding sequence and appropriate DNA sequences necessary for 
the expression of the operably linked coding sequence in a particular host organism. 
DNA sequences necessary for expression in procaryotes include a promoter, optionally 
an operator sequence, a ribosome binding site and possibly other sequences. 
Eukaryotic cells are known to utilize promoters, polyadenlyation signals and enhancers. 

The term "LTR" as used herein refers to the long terminal repeat found at each 
end of a provirus (i.e., the integrated form of a retrovirus). The LTR contains 
numerous regulatory signals including transcriptional control elements, polyadenylation 
signals and sequences needed for replication and integration of the viral genome. The 
viral LTR is divided into three regions called U3, R and U5. 

The U3 region contains the enhancer and promoter elements. The U5 region 
contains the polyadenylation signals. The R (repeat) region separates the U3 and U5 
regions and transcribed sequences of the R region appear at both the 5' and 3' ends of 
the viral RNA. 

The term "oligonucleotide" as used herein is defined as a molecule comprised 
of two or more deoxyribonucleotides or ribonucleotides, preferably more than three, 
and usually more than ten. The exact size will depend on many factors, which in turn 
depends on the ultimate function or use of the oligonucleotide. The oligonucleotide 
may be generated in any manner, including chemical synthesis, DNA replication, 
reverse transcription, or a combination thereof. 

Because mononucleotides are reacted to make oligonucleotides in a manner 
such that the 5' phosphate of one mononucleotide pentose ring is attached to the 3' 
oxygen of its neighbor in one direction via a phosphodiester linkage, an end of an 
oligonucleotide is referred to as the "5' end" if its 5' phosphate is not linked to the 3' 
oxygen of a mononucleotide pentose ring and as the "3' end" if its 3' oxygen is not 
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linked to a 5' phosphate of a subsequent mononucleotide pentose ring. As used 
herein, a nucleic acid sequence, even if internal to a larger oligonucleotide, also may 

be said to have 5' and 3' ends. 

When two different, non-overlapping oligonucleotides anneal to different 
5 regions of the same linear complementary nucleic acid sequence, and the 3' end of one 

oligonucleotide points towards the 5' end of the other, the former may be called the 
"upstream" oligonucleotide and the latter the "downstream" oligonucleotide. 

The term "primer" refers to an oligonucleotide which is capable of acting as a 
point of initiation of synthesis when placed under conditions in which primer extension 
10 is initiated. An oligonucleotide "primer" may occur naturally, as in a purified 

restriction digest or may be produced synthetically. 

A primer is selected to be "substantially" complementary to a strand of specific 
sequence of the template. A primer must be sufficiently complementary to hybridize 
with a template strand for primer elongation to occur. A primer sequence need not 
15 reflect the exact sequence of the template. For example, a non-complementary 

nucleotide fragment may be attached to the 5' end of the primer, with the remainder of 
the primer sequence being substantially complementary to the strand. Non- 
complementary bases or longer sequences can be interspersed into the primer, provided 
that the primer sequence has sufficient complementarity with the sequence of the 
20 template to hybridize and thereby form a template primer complex for synthesis of the 

extension product of the primer. 

"Hybridization" methods involve the annealing of a complementary sequence to 
the target nucleic acid (the sequence to be detected). The ability of two polymers of 
nucleic acid containing complementary sequences to find each other and anneal 
25 through base pairing interaction is a well-recognized phenomenon. The initial 

observations of the "hybridization" process by Marmur and Lane, Proc. Natl. Acad. 
Sci. USA 46:453 (1960) and Doty et al, Proc. Natl. Acad. Sci. USA 46:461 (1960) 
have been followed by the refinement of this process into an essential tool of modern 
biology. Nonetheless, a number of problems have prevented the wide scale use of 
30 hybridization as a tool in human diagnostics. Among the more formidable problems 
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are: 1) the inefficiency of hybridization; 2) the low concentration of specific target 
sequences in a mixture of genomic DNA; and 3) the hybridization of only partially 
complementary probes and targets. 

With regard to efficiency, it is experimentally observed that only a fraction of 
5 the possible number of probe-target complexes are formed in a hybridization reaction. 

This is particularly true with short oligonucleotide probes (less than 100 bases in 
length). There are three fundamental causes: a) hybridization cannot occur because of 
secondary and tertiary structure interactions; b) strands of DNA containing the target 
sequence have rehybridized (reannealed) to their complementary strand; and c) some 
10 target molecules are prevented from hybridization when they are used in hybridization 

formats that immobilize the target nucleic acids to a solid surface. 

Even where the sequence of a probe is completely complementary to the 
sequence of the target, /.*, the target's primary structure, the target sequence must be 
made accessible to the probe via rearrangements of higher-order structure. These 
15 higher-order structural rearrangements may concern either the secondary structure or 

tertiary structure of the molecule. Secondary structure is determined by intramolecular 
bonding. In the case of DNA or RNA targets this consists of hybridization within a 
single, continuous strand of bases (as opposed to hybridization between two different 
strands). Depending on the extent and position of intramolecular bonding, the probe 
20 can be displaced from the target sequence preventing hybridization. 

Solution hybridization of oligonucleotide probes to denatured double- stranded 
DNA is further complicated by the fact that the longer complementary target strands 
can renature or reanneal. Again, hybridized probe is displaced by this process. This 
results in a low yield of hybridization (low "coverage") relative to the starting 
25 concentrations of probe and target. 

With regard to low target sequence concentration, the DNA fragment 
containing the target sequence is usually in relatively low abundance in genomic DNA. 
This presents great technical difficulties; most conventional methods that use 
oligonucleotide probes lack the sensitivity necessary to detect hybridization at such low 
30 levels. 
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One attempt at a solution to the target sequence concentration problem is the 
amplification of the detection signal. Most often this entails placing one or more 
labels on an oligonucleotide probe. In the case of non-radioactive labels, even the 
highest affinity reagents have been found to be unsuitable for the detection of single 
5 copy genes in genomic DNA with oligonucleotide probes. See Wallace et al, 

Biochimie 67:755 (1985). In the case of radioactive oligonucleotide probes, only 
extremely high specific activities are found to show satisfactory results. See Studencki 
and Wallace, DNA 3:1 (1984) and Studencki et al., Human Genetics 37:42 (1985). 
With regard to complementarity, it is important for some diagnostic 
10 applications to determine whether the hybridization represents complete or partial 

complementarity. For example, where it is desired to detect simply the presence or 
absence of pathogen DNA (such as from a virus, bacterium, fungi, mycoplasma, 
protozoan) it is only important that the hybridization method ensures hybridization 
when the relevant sequence is present; conditions can be selected where both partially 
15 complementary probes and completely complementary probes will hybridize. Other 

diagnostic applications, however, may require that the hybridization method distinguish 
between partial and complete complementarity. It may be of interest to detect genetic 
polymorphisms. For example, human hemoglobin is composed, in part, of four 
polypeptide chains. Two of these chains are identical chains of 141 amino acids 
20 (alpha chains) and two of these chains are identical chains of 146 amino acids (beta 

chains). The gene encoding the beta chain is known to exhibit polymorphism. The 
normal allele encodes a beta chain having glutamic acid at the sixth position. The 
mutant allele encodes a beta chain having valine at the sixth position. This difference 
in amino acids has a profound (most profound when the individual is homozygous for 
25 the mutant allele) physiological impact known clinically as sickle cell anemia. It is 

well known that the genetic basis of the amino acid change involves a single base 
difference between the normal allele DNA sequence and the mutant allele DNA 
sequence. 
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Unless combined with other techniques (such as restriction enzyme analysis), 
methods that allow for the same level of hybridization in the case of both partial as 
well as complete complementarity are typically unsuited for such applications; the 
probe will hybridize to both the normal and variant target sequence. Hybridization, 
5 regardless of the method used, requires some degree of complementarity between the 

sequence being assayed (the target sequence) and the fragment of DNA used to 
perform the test (the probe). (Of course, one can obtain binding without any 
complementarity but this binding is nonspecific and to be avoided.) 

The complement of a nucleic acid sequence as used herein refers to an 
10 oligonucleotide which, when aligned with the nucleic acid sequence such that the 5' 

end of one sequence is paired with the 3' end of the other, is in "antiparallel 
association." Certain bases not commonly found in natural nucleic acids may be 
included in the nucleic acids of the present invention and include, for example, inosine 
and 7-deazaguanine. Complementarity need not be perfect; stable duplexes may 
15 contain mismatched base pairs or unmatched bases. Those skilled in the art of nucleic 

acid technology can determine duplex stability empirically considering a number of 
variables including, for example, the length of the oligonucleotide, base composition 
and sequence of the oligonucleotide, ionic strength and incidence of mismatched base 
pairs. 

20 Stability of a nucleic acid duplex is measured by the melting temperature, or 

"T m ." The T m of a particular nucleic acid duplex under specified conditions is the 
temperature at which on average half of the base pairs have disassociated. 

The term "probe" as used herein refers to a labeled oligonucleotide which 
forms a duplex structure with a sequence in another nucleic acid, due to 

25 complementarity of at least one sequence in the probe with a sequence in the other 

nucleic acid. 

The term "label" as used herein refers to any atom or molecule which can be 
used to provide a detectable (preferably quantifiable) signal, and which can be attached 
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to a nucleic acid or protein. Labels may provide signals detectable by fluorescence, 
radioactivity, colorimetry, gravimetry, X-ray diffraction or absorption, magnetism, 
enzymatic activity, and the like. 

The term "cleavage structure" as used herein, refers to a region of a single- 
5 stranded nucleic acid substrate containing secondary structure, said region being 

cleavable by a cleavage means, including but not limited to an enzyme. The cleavage 
structure is a substrate for specific cleavage by said cleavage means in contrast to a 
nucleic acid molecule which is a substrate for non-specific cleavage by agents such as 
phosphodiesterases which cleave nucleic acid molecules without regard to secondary 
10 structure {i.e., no folding of the substrate is required). 

The term "cleavage means" as used herein refers to any means which is capable 
of cleaving a cleavage structure, including but not limited to enzymes. The cleavage 
means may include native DNAPs having 5' nuclease activity {e.g., Taq DNA 
polymerase, E. coli DNA polymerase I) and, more specifically, modified DNAPs 
15 having 5' nuclease but lacking synthetic activity. The ability of 5' nucleases to cleave 

naturally occurring structures in nucleic acid templates (structure-specific cleavage) is 
useful to detect internal sequence differences in nucleic acids without prior knowledge 
of the specific sequence of the nucleic acid. In this manner, they are structure-specific 
enzymes. Structure-specific enzymes are enzymes which recognize specific secondary 
20 structures in a nucleic molecule and cleave these structures. The site of cleavage may 

be on either the 5' or 3' side of the cleavage structure; alternatively the site of 
cleavage may be between the 5' and 3' side {i.e., within or internal to) of the cleavage 
structure. The cleavage means of the invention cleave a nucleic acid molecule in 
response to the formation of cleavage structures; it is not necessary that the cleavage 
25 means cleave the cleavage structure at any particular location within the cleavage 

structure. 

The cleavage means is not restricted to enzymes having 5' nuclease activity. 
The cleavage means may include nuclease activity provided from a variety of sources 
including the Cleavase™ enzyme, Taq DNA polymerase, E. coli DNA polymerase I 



- 34 - 



25 



• 



and eukaryotic structure-specific endonucleases, murine FEN-1 endonucleases 
[Harrington and Liener, (1994) Genes and Develop. 8:1344] and calf thymus 5' to 3' 
exonuclease [Murante, R.S., et al. (1994) J. Biol. Chen, 269:1191]). In addition, 
enzymes having 3' nuclease activity such as members of the family of DNA repair 
5 endonucleases (e.g. , the RrpI enzyme from Drosophila melanogaster, the yeast 

RAD1/RAD10 complex and E. coli Exo III), are also suitable cleavage means for the 
practice of the methods of the invention. 

The term "cleavage products" as used herein, refers to products generated by 
the reaction of a cleavage means with a cleavage structure {i.e., the treatment of a 
10 cleavage structure with a cleavage means). 

The terms "nucleic acid substrate" and nucleic acid template" are used herein 
interchangeably and refer to a nucleic acid molecule which when denatured and 
allowed to renature {i.e., to fold upon itself by the formation of intra-strand hydrogen 
bonds), forms at least one cleavage structure. The nucleic acid substrate may comprise 
15 single- or double-stranded DNA or RNA. 

The term "substantially single- stranded" when used in reference to a nucleic 
acid substrate means that the substrate molecule exists primarily as a single strand of 
nucleic acid in contrast to a double-stranded substrate which exists as two strands of 
nucleic acid which are held together by inter-strand base pairing interactions. 

Nucleic acids form secondary structures which depend on base-pairing for 
stability. When single strands of nucleic acids (single-stranded DNA, denatured 
double-stranded DNA or RNA) with different sequences, even closely related ones, are 
allowed to fold on themselves, they assume characteristic secondary structures. At 
"elevated temperatures" the duplex regions of the structures are brought to the brink of 
instability, so that the effects of small changes in sequence are maximized, and 
revealed as alterations in the cleavage pattern. In other words, "an elevated 
temperature" is a temperature at which a given duplex region of the folded substrate 
molecule is near the temperature at which that duplex melts. An alteration in the 
sequence of the substrate will then be likely to cause the destruction of a duplex 
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re gl on(s) thereby generating a different cleavage pattern when a cleavage agent which 
is dependent upon the recognition of structure is utilized in the reaction. While not 
being limited to any particular theory, it is thought that individual molecules in the 
target (i.e., the substrate) population may each assume only one or a few of the 
5 potential cleavage structures (i.e., duplexed regions), but when the sample is analyzed 

as a whole, a composite pattern representing all cleavage sites is detected. Many of 
the structures recognized as active cleavage sites are likely to be only a few base-pairs 
long and would appear to be unstable when elevated temperatures used in the cleavage 
reaction. Nevertheless, transient formation of these structures allows recognition and 
10 cleavage of these structures by said cleavage means. The formation or disruption of 

these structures in response to small sequence changes results in changes in the 
patterns of cleavage. Temperatures in the range of 40-85°C, with the range of 55- 
85°C being particularly preferred, are suitable elevated temperatures for the practice of 

the method of the invention. 
15 The term "sequence variation" as used herein refers to differences in nucleic 

acid sequence between two nucleic acid templates. For example, a wild-type structural 
gene and a mutant form of this wild-type structural gene may vary in sequence by the 
presence of single base substitutions and/or deletions or insertions of one or more 
nucleotides. These two forms of the structural gene are said to vary in sequence from 
20 one another. A second mutant form of the structural gene may exits. This second 

mutant form is said to vary in sequence from both the wild-type gene and the first 
mutant form of the gene. It is noted, however, that the invention does not require that 
a comparison be made between one or more forms of a gene to detect sequence 
variations. Because the method of the invention generates a characteristic and 
25 reproducible pattern of cleavage products for a given nucleic acid substrate, a 

characteristic "fingerprint" may be obtained from any nucleic substrate without 
reference to a wild-type or other control. The invention contemplates the use of the 
method for both "fingerprinting" nucleic acids without reference to a control and 
identification of mutant forms of a substrate nucleic acid by comparison of the mutant 
30 form of the substrate with a wild-type or known mutant control. 
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The term "liberating" as used herein refers to the release of a nucleic acid 
fragment from a larger nucleic acid fragment, such as an oligonucleotide, by the action 
of a 5' nuclease such that the released fragment is no longer covalently attached to the 
remainder of the oligonucleotide. 
5 The term "substrate strand" as used herein, means that strand of nucleic acid in 

a cleavage structure in which the cleavage mediated by the 5' nuclease activity occurs. 

The term "template strand" as used herein, means that strand of nucleic acid in 
a cleavage structure which is at least partially complementary to the substrate strand 
and which anneals to the substrate strand to form the cleavage structure. 
10 The term "K^" as used herein refers to the Michaelis-Menten constant for an 

enzyme and is defined as the concentration of the specific substrate at which a given 
enzyme yields one-half its maximum velocity in an enzyme catalyzed reaction. 

The term "nucleotide analog" as used herein refers to modified or non-naturally 
occurring nucleotides such as 7-deaza purines {i.e., 7-deaza-dATP and 7-deaza-dGTP). 
15 Nucleotide analogs include base analogs and comprise modified forms of 

deoxyribonucleotides as well as ribonucleotides. As used herein the term "nucleotide 
analog" when used in reference to substrates present in a PCR mixture refers to the use 
of nucleotides other than dATP, dGTP, dCTP and dTTP; thus, the use of dUTP (a 
naturally occurring dNTP) in a PCR would comprise the use of a nucleotide analog in 
20 the PCR. A PCR product generated using dUTP, 7-deaza-dATP, 7-deaza-dGTP or any 

other nucleotide analog in the reaction mixture is said to contain nucleotide analogs. 

"Oligonucleotide primers matching or complementary to a gene sequence" 
refers to oligonucleotide primers capable of facilitating the template-dependent 
synthesis of single or double-stranded nucleic acids. Oligonucleotide primers matching 
25 or complementary to a gene sequence may be used in PCRs, RT-PCRs and the like. 

A "consensus gene sequence" refers to a gene sequence which is derived by 
comparison of two or more gene sequences and which describes the nucleotides most 
often present in a given segment of the genes; the consensus sequence is the canonical 
sequence. 
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The term "polymorphic locus" is a locus present in a population which shows 
variation between members of the population {i.e., the most common allele has a 
frequency of less than 0.95). In contrast, a "monomorphic locus" is a genetic locus at 
little or no variations seen between members of the population (generally taken to be a 
locus at which the most common allele exceeds a frequency of 0.95 in the gene pool 
of the population). 

The term "microorganism" as used herein means an organism too small to be 
observed with the unaided eye and includes, but is not limited to bacteria, virus, 

protozoans, fungi, and ciliates. 

The term "microbial gene sequences" refers to gene sequences derived from a 

microorganism. 

The term "bacteria" refers to any bacterial species including eubacterial and 

archaebacterial species. 

The term "virus" refers to obligate, ultramicroscopic, intracellular parasites 
incapable of autonomous replication {i.e., replication requires the use of the host cell's 
machinery). 

The term "multi-drug resistant" or multiple-drug resistant" refers to a 
microorganism which is resistant to more than one of the antibiotics or antimicrobial 
agents used in the treatment of said microorganism. 



20 DESCRIPTION OF THE INVENTION 

The present invention relates to methods and compositions for treating nucleic 
acid, and in particular, methods and compositions for detection and characterization of 
nucleic acid sequences and sequence changes. 

The present invention relates to means for cleaving a nucleic acid cleavage 
25 structure in a site-specific manner. In particular, the present invention relates to a 

cleaving enzyme having 5' nuclease activity without interfering nucleic acid synthetic 
ability. 

This invention provides 5' nucleases derived from thermostable DNA 
polymerases which exhibit altered DNA synthetic activity from that of native 
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thermostable DNA polymerases. The 5' nuclease activity of the polymerase is retained 
while the synthetic activity is reduced or absent. Such 5' nucleases are capable of 
catalyzing the structure- specific cleavage of nucleic acids in the absence of interfering 
synthetic activity. The lack of synthetic activity during a cleavage reaction results m 
nucleic acid cleavage products of uniform size. 

The novel properties of the polymerases of the invention form the basis of a 
method of detecting specific nucleic acid sequences. This method relies upon the 
amplification of the detection molecule rather than upon the amplification of the target 
sequence itself as do existing methods of detecting specific target sequences. 

DNA polymerases (DNAPs), such as those isolated from E. coli or from 
thermophilic bacteria of the genus Thermus, are enzymes that synthesize new DNA 
strands. Several of the known DNAPs contain associated nuclease activities in 
addition to the synthetic activity of the enzyme. 

Some DNAPs are known to remove nucleotides from the 5' and 3' ends of 
DNA chains [Kornberg, DNA Replication, W.H. Freeman and Co., San Francisco, 
pp. 127-139 (1980)]. These nuclease activities are usually referred to as 5' 
exonuclease and 3' exonuclease activities, respectively. For example, the 5' 
exonuclease activity located in the N-terminal domain of several DNAPs participates in 
the removal of RNA primers during lagging strand synthesis during DNA replication 
and the removal of damaged nucleotides during repair. Some DNAPs, such as the E. 
coli DNA polymerase (DNAPEcl), also have a 3' exonuclease activity responsible for 
proof-reading during DNA synthesis (Kornberg, supra). 

A DNAP isolated from Thermus aquaticus, termed Tag DNA polymerase 
(DNAP Tag), has a 5' exonuclease activity, but lacks a functional 3' exonucleolytic 
domain [Tindall and Kunkell, Biochem. 27:6008 (1988)]. Derivatives of DNAPEcl 
and DNAP Tag, respectively called the Klenow and Stoffel fragments, lack 5' 
exonuclease domains as a result of enzymatic or genetic manipulations [Brutlag et al, 
Biochem. Biophys. Res. Commun. 37:982 (1969); Erlich et al, Science 252:1643 
(1991); Setlow and Kornberg, J. Biol. Chem. 247:232 (1972)]. 
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The 5' exonuclease activity of DNAPZag was reported to require concurrent 
synthesis [Gelfand, PCR Technology - Principles and Applications for DNA 
Amplification (H.A. Erhch, Ed.), Stockton Press, New York, p. 19 (1989)]. Although 
mononucleotides predominate among the digestion products of the 5' exonucleases of 
DNAPTfcf and DNAPEcl, short oligonucleotides (< 12 nucleotides) can also be 
observed implying that these so-called 5' exonucleases can function 
endonucleolytically [Setlow, supra; Holland et al., Proc. Natl. Acad. Sci. USA 88:7276 
(1991)]. 

In WO 92/06200, Gelfand et al. show that the preferred substrate of the 5' 
exonuclease activity of the thermostable DNA polymerases is displaced single-stranded 
DNA. Hydrolysis of the phosphodiester bond occurs between the displaced single- 
stranded DNA and the double-helical DNA with the preferred exonuclease cleavage 
site being a phosphodiester bond in the double helical region. Thus, the 5' 
exonuclease activity usually associated with DNAPs is a structure-dependent single- 
stranded endonuclease and is more properly referred to as a 5' nuclease. Exonucleases 
are enzymes which cleave nucleotide molecules from the ends of the nucleic acid 
molecule. Endonucleases, on the other hand, are enzymes which cleave the nucleic 
acid molecule at internal rather than terminal sites. The nuclease activity associated 
with some thermostable DNA polymerases cleaves endonucleolytically but this 
cleavage requires contact with the 5' end of the molecule being cleaved. Therefore, 
these nucleases are referred to as 5' nucleases. 

When a 5' nuclease activity is associated with a eubacterial Type A DNA 
polymerase, it is found in the one-third N-terminal region of the protein as an 
independent functional domain. The C-terminal two-thirds of the molecule constitute 
the polymerization domain which is responsible for the synthesis of DNA. Some Type 
A DNA polymerases also have a 3' exonuclease activity associated with the two-third 
C-terminal region of the molecule. 

The 5' exonuclease activity and the polymerization activity of DNAPs have 
been separated by proteolytic cleavage or genetic manipulation of the polymerase 
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molecule. To date thermostable DNAPs have been modified to remove or reduce the 
amount of 5' nuclease activity while leaving the polymerase activity intact. 

The Klenow or large proteolytic cleavage fragment of DNAPEcl contains the 
polymerase and 3' exonuclease activity but lacks the 5' nuclease activity. The Stoffel 
5 fragment of DNAP Tag (DNAPStf) lacks the 5' nuclease activity due to a genetic 

manipulation which deleted the N-terminal 289 amino acids of the polymerase 
molecule [Erlich et al., Science 252:1643 (1991)]. WO 92/06200 describes a 
thermostable DNAP with an altered level of 5' to 3' exonuclease. U.S. Patent No. 
5,108,892 describes a Thermus aquaticus DNAP without a 5' to 3' exonuclease. 
10 However, the art of molecular biology lacks a thermostable DNA polymerase with a 

lessened amount of synthetic activity. 

The present invention provides 5' nucleases derived from thermostable Type A 
DNA polymerases that retain 5' nuclease activity but have reduced or absent synthetic 
activity. The ability to uncouple the synthetic activity of the enzyme from the 5' 
15 nuclease activity proves that the 5' nuclease activity does not require concurrent DNA 

synthesis as was previously reported (Gelfand, PCR Technology, supra). 

The description of the invention is divided into: I. Detection of Specific 
Nucleic Acid Sequences Using 5' Nucleases; II. Generation of 5' Nucleases Derived 
From Thermostable DNA Polymerases; III. Therapeutic Uses of 5' Nucleases; IV. 
20 Detection of Antigenic or Nucleic Acid Targets by a Dual Capture Assay; and V. 

Cleavase™ Fragment Length Polymorphism for the Detection of Secondary Structure 
and VI. Detection of Mutations in the p53 Tumor Suppressor Gene Using the CFLP™ 
Method. 
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I. Detection Of Specific Nucleic Acid Sequences Using 5' Nucleases 

The 5' nucleases of the invention form the basis of a novel detection assay for 
the identification of specific nucleic acid sequences. This detection system identifies 
the presence of specific nucleic acid sequences by requiring the annealing of two 
oligonucleotide probes to two portions of the target sequence. As used herein, the 
term "target sequence" or "target nucleic acid sequence" refers to a specific nucleic 
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acid sequence within a polynucleotide sequence, such as genomic DNA or RNA, 
which is to be either detected or cleaved or both. 

Figure 1 A provides a schematic of one embodiment of the detection method of 
the present invention. The target sequence is recognized by two distinct 
5 oligonucleotides in the triggering or trigger reaction. It is preferred that one of these 

oligonucleotides is provided on a solid support. The other can be provided free. In 
Figure 1A the free oligo is indicated as a "primer" and the other oligo is shown 
attached to a bead designated as type 1. The target nucleic acid aligns the two 
oligonucleotides for specific cleavage of the 5' arm (of the oligo on bead 1) by the 
10 DNAPs of the present invention (not shown in Figure 1 A). 

The site of cleavage (indicated by a large solid arrowhead) is controlled by the 
distance between the 3' end of the "primer" and the downstream fork of the oligo on 
bead 1. The latter is designed with an uncleavable region (indicated by the striping). 
In this manner neither oligonucleotide is subject to cleavage when misaligned or when 

15 unattached to target nucleic acid. 

Successful cleavage releases a single copy of what is referred to as the alpha 
signal oligo. This oligo may contain a detectable moiety {e.g., fluorescein). On the 
other hand, it may be unlabelled. 

In one embodiment of the detection method, two more oligonucleotides are 
20 provided on solid supports. The oligonucleotide shown in Figure 1A on bead 2 has a 

region that is complementary to the alpha signal oligo (indicated as alpha prime) 
allowing for hybridization. This structure can be cleaved by the DNAPs of the present 
invention to release the beta signal oligo. The beta signal oligo can then hybridize to 
type 3 beads having an oligo with a complementary region (indicated as beta prime). 
25 Again, this structure can be cleaved by the DNAPs of the present invention to release 

a new alpha oligo. 

At this point, the amplification has been linear. To increase the power of the 
method, it is desired that the alpha signal oligo hybridized to bead type 2 be liberated 
after release of the beta oligo so that it may go on to hybridize with other oligos on 
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type 2 beads. Similarly, after release of an alpha oligo from type 3 beads, it is desired 

that the beta oligo be liberated. 

The liberation of "captured" signal oligos can be achieved in a number of ways. 
First, it has been found that the DNAPs of the present invention have a true 5' 
exonuclease capable of "nibbling" the 5' end of the alpha (and beta) prime oligo 
(discussed below in more detail). Thus, under appropriate conditions, the 
hybridization is destabilized by nibbling of the DNAP. Second, the alpha - alpha 
prime (as well as the beta - beta prime) complex can be destabilized by heat {e.g., 
thermal cycling). 

With the liberation of signal oligos by such techniques, each cleavage results in 
a doubling of the number of signal oligos. In this manner, detectable signal can 

quickly be achieved. 

Figure IB provides a schematic of a second embodiment of the detection 
method of the present invention. Again, the target sequence is recognized by two 
distinct oligonucleotides in the triggering or trigger reaction and the target nucleic acid 
aligns the two oligonucleotides for specific cleavage of the 5' arm by the DNAPs of 
the present invention (not shown in Figure IB). The first oligo is completely 
complementary to a portion of the target sequence. The second oligonucleotide is 
partially complementary to the target sequence; the 3' end of the second 
oligonucleotide is fully complementary to the target sequence while the 5' end is non- 
complementary and forms a single-stranded arm. The non-complementary end of the 
second oligonucleotide may be a generic sequence which can be used with a set of 
standard hairpin structures (described below). The detection of different target 
sequences would require unique portions of two oligonucleotides: the entire first 
oligonucleotide and the 3' end of the second oligonucleotide. The 5' arm of the 
second oligonucleotide can be invariant or generic in sequence. 

The annealing of the first and second oligonucleotides near one another along 
the target sequence forms a forked cleavage structure which is a substrate for the 5' 
nuclease of DNA polymerases. The approximate location of the cleavage site is again 
indicated by the large solid arrowhead in Figure IB. 
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The 5' nucleases of the invention are capable of cleaving this structure but are 
not capable of polymerizing the extension of the 3' end of the first oligonucleotide. 
The lack of polymerization activity is advantageous as extension of the first 
oligonucleotide results in displacement of the annealed region of the second 
5 oligonucleotide and results in moving the site of cleavage along the second 

oligonucleotide. If polymerization is allowed to occur to any significant amount, 
multiple lengths of cleavage product will be generated. A single cleavage product of 
uniform length is desirable as this cleavage product initiates the detection reaction. 

The trigger reaction may be run under conditions that allow for thermocycling. 

10 Thermocycling of the reaction allows for a logarithmic increase in the amount of the 

trigger oligonucleotide released in the reaction. 

The second part of the detection method allows the annealing of the fragment 
of the second oligonucleotide liberated by the cleavage of the first cleavage structure 
formed in the triggering reaction (called the third or trigger oligonucleotide) to a first 

15 hairpin structure. This first hairpin structure has a single-stranded 5' arm and a single- 

stranded 3' arm. The third oligonucleotide triggers the cleavage of this first hairpin 
structure by annealing to the 3' arm of the hairpin thereby forming a substrate for 
cleavage by the 5 ' nuclease of the present invention. The cleavage of this first hairpin 
structure generates two reaction products: 1) the cleaved 5' arm of the hairpin called 

20 the fourth oligonucleotide, and 2) the cleaved hairpin structure which now lacks the 5' 

arm and is smaller in size than the uncleaved hairpin. This cleaved first hairpin may 
be used as a detection molecule to indicate that cleavage directed by the trigger or 
third oligonucleotide occurred. Thus, this indicates that the first two oligonucleotides 
found and annealed to the target sequence thereby indicating the presence of the target 

25 sequence in the sample. 

The detection products are amplified by having the fourth oligonucleotide 
anneal to a second hairpin structure. This hairpin structure has a 5' single-stranded 
arm and a 3' single-stranded arm. The fourth oligonucleotide generated by cleavage of 
the first hairpin structure anneals to the 3' arm of the second hairpin structure thereby 

30 creating a third cleavage structure recognized by the 5' nuclease. The cleavage of this 
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second hairpin structure also generates two reaction products: 1) the cleaved 5' arm of 
the hairpin called the fifth oligonucleotide which is similar or identical in sequence to 
the third nucleotide, and 2) the cleaved second hairpin structure which now lacks the 
5' arm and is smaller in size than the uncleaved hairpin. This cleaved second hairpin 
may be as a detection molecule and amplifies the signal generated by the cleavage of 
the first hairpin structure. Simultaneously with the annealing of the forth 
oligonucleotide, the third oligonucleotide is dissociated from the cleaved first hairpin 
molecule so that it is free to anneal to a new copy of the first hairpin structure. The 
disassociation of the oligonucleotides from the hairpin structures may be accomplished 
by heating or other means suitable to disrupt base-pairing interactions. 

Further amplification of the detection signal is achieved by annealing the fifth 
oligonucleotide (similar or identical in sequence to the third oligonucleotide) to another 
molecule of the first hairpin structure. Cleavage is then performed and the 
oligonucleotide that is liberated then is annealed to another molecule of the second 
hairpin structure. Successive rounds of annealing and cleavage of the first and second 
hairpin structures, provided in excess, are performed to generate a sufficient amount of 
cleaved hairpin products to be detected. The temperature of the detection reaction is 
cycled just below and just above the annealing temperature for the oligonucleotides 
used to direct cleavage of the hairpin structures, generally about 55°C to 70°C. The 
number of cleavages will double in each cycle until the amount of hairpin structures 
remaining is below the for the hairpin structures. This point is reached when the 
hairpin structures are substantially used up. When the detection reaction is to be used 
in a quantitative manner, the cycling reactions are stopped before the accumulation of 
the cleaved hairpin detection products reach a plateau. 

Detection of the cleaved hairpin structures may be achieved in several ways. In 
one embodiment detection is achieved by separation on agarose or polyacrylamide gels 
followed by staining with ethidium bromide. In another embodiment, detection is 
achieved by separation of the cleaved and uncleaved hairpin structures on a gel 
followed by autoradiography when the hairpin structures are first labelled with a 
radioactive probe and separation on chromatography columns using HPLC or FPLC 
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followed by detection of the differently sized fragments by absorption at OD 260 . 
Other means of detection include detection of changes in fluorescence polarization 
when the single-stranded 5' arm is released by cleavage, the increase in fluorescence 
of an intercalating fluorescent indicator as the amount of primers annealed to 3' arms 
5 of the hairpin structures increases. The formation of increasing amounts of duplex 

DNA (between the primer and the 3' arm of the hairpin) occurs if successive rounds 
of cleavage occur. 

The hairpin structures may be attached to a solid support, such as an agarose, 
styrene or magnetic bead, via the 3' end of the hairpin. A spacer molecule may be 

10 placed between the 3' end of the hairpin and the bead, if so desired. The advantage of 

attaching the hairpin structures to a solid support is that this prevents the hybridization 
of the two hairpin structures to one another over regions which are complementary. If 
the hairpin structures anneal to one another, this would reduce the amount of hairpins 
available for hybridization to the primers released during the cleavage reactions. If the 

1 5 hairpin structures are attached to a solid support, then additional methods of detection 

of the products of the cleavage reaction may be employed. These methods include, 
but are not limited to, the measurement of the released single-stranded 5' arm when 
the 5' arm contains a label at the 5' terminus. This label may be radioactive, 
fluorescent, biotinylated, etc. If the hairpin structure is not cleaved, the 5' label will 

20 remain attached to the solid support. If cleavage occurs, the 5' label will be released 

from the solid support. 

The 3' end of the hairpin molecule may be blocked through the use of 
dideoxynucleotides. A 3' terminus containing a dideoxynucleotide is unavailable to 
participate in reactions with certain DNA modifying enzymes, such as terminal 

25 transferase. Cleavage of the hairpin having a 3' terminal dideoxynucleotide generates 

a new, unblocked 3' terminus at the site of cleavage. This new 3' end has a free 
hydroxyl group which can interact with terminal transferase thus providing another 
means of detecting the cleavage products. 

The hairpin structures are designed so that their self-complementary regions are 

30 very short (generally in the range of 3-8 base pairs). Thus, the hairpin structures are 
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not stable at the high temperatures at which this reaction is performed (generally in the 
range of 50-75°C) unless the hairpin is stabilized by the presence of the annealed 
oligonucleotide on the 3 ' arm of the hairpin. This instability prevents the polymerase 
from cleaving the hairpin structure in the absence of an associated primer thereby 
5 preventing false positive results due to non-oligonucleotide directed cleavage. 

As discussed above, the use of the 5' nucleases of the invention which have 
reduced polymerization activity is advantageous in this method of detecting specific 
nucleic acid sequences. Significant amounts of polymerization during the cleavage 
reaction would cause shifting of the site of cleavage in unpredictable ways resulting in 

10 the production of a series of cleaved hairpin structures of various sizes rather than a 

single easily quantifiable product. Additionally, the primers used in one round of 
cleavage could, if elongated, become unusable for the next cycle, by either forming an 
incorrect structure or by being too long to melt off under moderate temperature cycling 
conditions. In a pristine system (i.e., lacking the presence of dNTPs), one could use 

15 the unmodified polymerase, but the presence of nucleotides (dNTPs) can decrease the 

per cycle efficiency enough to give a false negative result. When a crude extract 
(genomic DNA preparations, crude cell lysates, etc.) is employed or where a sample of 
DNA from a PCR reaction, or any other sample that might be contaminated with 
dNTPs, the 5 ' nucleases of the present invention that were derived from thermostable 

20 polymerases are particularly useful. 

II. Generation Of 5' Nucleases From Thermostable DNA Polymerases 

The genes encoding Type A DNA polymerases share about 85% homology to 
each other on the DNA sequence level. Preferred examples of thermostable 
polymerases include those isolated from Thermus aquaticus, Thermus flavus, and 
25 Thermus thermophilus . However, other thermostable Type A polymerases which have 

5' nuclease activity are also suitable. Figs. 2 and 3 compare the nucleotide and amino 
acid sequences of the three above mentioned polymerases. In Figures 2 and 3, the 
consensus or majority sequence derived from a comparison of the nucleotide (Fig. 2) 
or amino acid (Fig. 3) sequence of the three thermostable DNA polymerases is shown 
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on the top line. A dot appears in the sequences of each of these three polymerases 
whenever an amino acid residue in a given sequence is identical to that contained in 
the consensus amino acid sequence. Dashes are used to introduce gaps in order to 
maximize alignment between the displayed sequences. When no consensus nucleotide 
5 or amino acid is present at a given position, an "X" is placed in the consensus 

sequence. SEQ ID NOS:l-3 display the nucleotide sequences and SEQ ID NOS:4-6 
display the amino acid sequences of the three wild-type polymerases. SEQ ID NO:l 
corresponds to the nucleic acid sequence of the wild type Thermus aquaticus DNA 
polymerase gene isolated from the YT-1 strain [Lawyer et al, J. Biol. Chem. 264:6427 

10 (1989)]. SEQ ID NO:2 corresponds to the nucleic acid sequence of the wild type 

Thermus flavus DNA polymerase gene [Akhmetzjanov and Vakhitov, Nucl. Acids Res. 
20:5839 (1992)]. SEQ ID NO:3 corresponds to the nucleic acid sequence of the wild 
type Thermus thermophilus DNA polymerase gene [Gelfand et al, WO 91/09950 
(1991)]. SEQ ID NOS:7-8 depict the consensus nucleotide and amino acid sequences, 

15 respectively for the above three DNAPs (also shown on the top row in Figs. 2 and 3). 

The 5' nucleases of the invention derived from thermostable polymerases have 
reduced synthetic ability, but retain substantially the same 5' exonuclease activity as 
the native DNA polymerase. The term "substantially the same 5' nuclease activity" as 
used herein means that the 5' nuclease activity of the modified enzyme retains the 

20 ability to function as a structure-dependent single- stranded endonuclease but not 

necessarily at the same rate of cleavage as compared to the unmodified enzyme. Type 
A DNA polymerases may also be modified so as to produce an enzyme which has 
increases 5' nuclease activity while having a reduced level of synthetic activity. 
Modified enzymes having reduced synthetic activity and increased 5' nuclease activity 

25 are also envisioned by the present invention. 

By the term "reduced synthetic activity" as used herein it is meant that the 
modified enzyme has less than the level of synthetic activity found in the unmodified 
or "native" enzyme. The modified enzyme may have no synthetic activity remaining 
or may have that level of synthetic activity that will not interfere with the use of the 

30 modified enzyme in the detection assay described below. The 5' nucleases of the 
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present invention are advantageous in situations where the cleavage activity of the 
polymerase is desired, but the synthetic ability is not (such as in the detection assay of 
the invention). 

As noted above, it is not intended that the invention be limited by the nature of 
the alteration necessary to render the polymerase synthesis deficient. The present 
invention contemplates a variety of methods, including but not limited to: 
1) proteolysis; 2) recombinant constructs (including mutants); and 3) physical and/or 
chemical modification and/or inhibition. 

1. Proteolysis 

Thermostable DNA polymerases having a reduced level of synthetic activity are 
produced by physically cleaving the unmodified enzyme with proteolytic enzymes to 
produce fragments of the enzyme that are deficient in synthetic activity but retain 5' 
nuclease activity. Following proteolytic digestion, the resulting fragments are 
separated by standard chromatographic techniques and assayed for the ability to 
synthesize DNA and to act as a 5' nuclease. The assays to determine synthetic activity 
and 5' nuclease activity are described below. 

2. Recombinant Constructs 

The examples below describe a preferred method for creating a construct 
encoding a 5' nuclease derived from a thermostable DNA polymerase. As the Type A 
DNA polymerases are similar in DNA sequence, the cloning strategies employed for 
the Thermus aquaticus and flavus polymerases are applicable to other thermostable 
Type A polymerases. In general, a thermostable DNA polymerase is cloned by 
isolating genomic DNA using molecular biological methods from a bacteria containing 
a thermostable Type A DNA polymerase. This genomic DNA is exposed to primers 
which are capable of amplifying the polymerase gene by PCR. 

This amplified polymerase sequence is then subjected to standard deletion 
processes to delete the polymerase portion of the gene. Suitable deletion processes are 
described below in the examples. 
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The example below discusses the strategy used to determine which portions of 
the DNAP Taq polymerase domain could be removed without eliminating the 5' 
nuclease activity. Deletion of amino acids from the protein can be done either by 
deletion of the encoding genetic material, or by introduction of a translational stop 
5 codon by mutation or frame shift. In addition, proteolytic treatment of the protein 

molecule can be performed to remove segments of the protein. 

In the examples below, specific alterations of the Taq gene were: a deletion 
between nucleotides 1601 and 2502 (the end of the coding region), a 4 nucleotide 
insertion at position 2043, and deletions between nucleotides 1614 and 1848 and 

10 between nucleotides 875 and 1778 (numbering is as in SEQ ID NO:l). These 

modified sequences are described below in the examples and at SEQ ID NOS:9-12. 

Those skilled in the art understand that single base pair changes can be 
innocuous in terms of enzyme structure and function. Similarly, small additions and 
deletions can be present without substantially changing the exonuclease or polymerase 

15 function of these enzymes. 

Other deletions are also suitable to create the 5' nucleases of the present 
invention. It is preferable that the deletion decrease the polymerase activity of the 5' 
nucleases to a level at which synthetic activity will not interfere with the use of the 5' 
nuclease in the detection assay of the invention. Most preferably, the synthetic ability 

20 is absent. Modified polymerases are tested for the presence of synthetic and 5' 

nuclease activity as in assays described below. Thoughtful consideration of these 
assays allows for the screening of candidate enzymes whose structure is heretofore as 
yet unknown. In other words, construct "X" can be evaluated according to the 
protocol described below to determine whether it is a member of the genus of 5' 

25 nucleases of the present invention as defined functionally, rather than structurally. 

In the example below, the PCR product of the amplified Thermus aquaticus 
genomic DNA did not have the identical nucleotide structure of the native genomic 
DNA and did not have the same synthetic ability of the original clone. Base pair 
changes which result due to the infidelity of DNAPTaq during PCR amplification of a 

30 polymerase gene are also a method by which the synthetic ability of a polymerase 
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gene may be inactivated. The examples below and Figs. 4A and 5A indicate regions 
in the native Thermus aquaticus and Jlavus DNA polymerases likely to be important 
for synthetic ability. There are other base pair changes and substitutions that will 
likely also inactivate the polymerase. 
5 It is not necessary, however, that one start out the process of producing a 5' 

nuclease from a DNA polymerase with such a mutated amplified product. This is the 
method by which the examples below were performed to generate the synthesis- 
deficient DNAPTaq mutants, but it is understood by those skilled in the art that a 
wild-type DNA polymerase sequence may be used as the starting material for the 
10 introduction of deletions, insertion and substitutions to produce a 5' nuclease. For 

example, to generate the synthesis-deficient DNAPTfl mutant, the primers listed in 
SEQ ID NOS: 13-14 were used to amplify the wild type DNA polymerase gene from 
Thermus flavus strain AT-62. The amplified polymerase gene was then subjected to 
restriction enzyme digestion to delete a large portion of the domain encoding the 

15 synthetic activity. 

The present invention contemplates that the nucleic acid construct of the 
present invention be capable of expression in a suitable host. Those in the art know 
methods for attaching various promoters and 3' sequences to a gene structure to 
achieve efficient expression. The examples below disclose two suitable vectors and six 

20 suitable vector constructs. Of course, there are other promoter/vector combinations 

that would be suitable. It is not necessary that a host organism be used for the 
expression of the nucleic acid constructs of the invention. For example, expression of 
the protein encoded by a nucleic acid construct may be achieved through the use of a 
cell-free in vitro transcription/translation system. An example of such a cell-free 

25 system is the commercially available TnT™ Coupled Reticulocyte Lysate System 

(Promega Corporation, Madison, WI). 

Once a suitable nucleic acid construct has been made, the 5' nuclease may be 
produced from the construct. The examples below and standard molecular biological 
teachings enable one to manipulate the construct by different suitable methods. 
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Once the 5' nuclease has been expressed, the polymerase is tested for both 
synthetic and nuclease activity as described below. 

3. Physical And/Or Chemical Modification And/Or 
Inhibition 

5 The synthetic activity of a thermostable DNA polymerase may be reduced by 

chemical and/or physical means. In one embodiment, the cleavage reaction catalyzed 
by the 5' nuclease activity of the polymerase is run under conditions which 
preferentially inhibit the synthetic activity of the polymerase. The level of synthetic 
activity need only be reduced to that level of activity which does not interfere with 

10 cleavage reactions requiring no significant synthetic activity. 

As shown in the examples below, concentrations of Mg^ greater than 5 mM 
inhibit the polymerization activity of the native DNAPJag. The ability of the 5 ' 
nuclease to function under conditions where synthetic activity is inhibited is tested by 
running the assays for synthetic and 5' nuclease activity, described below, in the 

15 presence of a range of Mg^ concentrations (5 to 10 mM). The effect of a given 

concentration of Mg^ is determined by quantitation of the amount of synthesis and 
cleavage in the test reaction as compared to the standard reaction for each assay. 

The inhibitory effect of other ions, polyamines, denaturants, such as urea, 
formamide, dimethylsulfoxide, glycerol and non-ionic detergents (Triton X-100 and 

20 TWEEN-20), nucleic acid binding chemicals such as, actinomycin D, ethidium 

bromide and psoralens, are tested by their addition to the standard reaction buffers for 
the synthesis and 5' nuclease assays. Those compounds having a preferential 
inhibitory effect on the synthetic activity of a thermostable polymerase are then used to 
create reaction conditions under which 5' nuclease activity (cleavage) is retained while 

25 synthetic activity is reduced or eliminated. 

Physical means may be used to preferentially inhibit the synthetic activity of a 
polymerase. For example, the synthetic activity of thermostable polymerases is 
destroyed by exposure of the polymerase to extreme heat (typically 96 to 100°C) for 
extended periods of time (greater than or equal to 20 minutes). While these are minor 
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differences with respect to the specific heat tolerance for each of the enzymes, these 
are readily determined. Polymerases are treated with heat for various periods of time 
and the effect of the heat treatment upon the synthetic and 5' nuclease activities is 
determined. 

5 III. Therapeutic Utility Of 5' Nucleases 

The 5' nucleases of the invention have not only the diagnostic utility discussed 
above, but additionally have therapeutic utility for the cleavage and inactivation of 
specific mRNAs inside infected cells. The mRNAs of pathogenic agents, such as 
viruses, bacteria, are targeted for cleavage by a synthesis-deficient DNA polymerase by 

10 the introduction of a oligonucleotide complementary to a given mRNA produced by 

the pathogenic agent into the infected cell along with the synthesis-deficient 
polymerase. Any pathogenic agent may be targeted by this method provided the 
nucleotide sequence information is available so that an appropriate oligonucleotide may 
be synthesized. The synthetic oligonucleotide anneals to the complementary mRNA 

15 thereby forming a cleavage structure recognized by the modified enzyme. The ability 

of the 5' nuclease activity of thermostable DNA polymerases to cleave RNA-DNA 
hybrids is shown herein in Example ID. 

Liposomes provide a convenient delivery system. The synthetic oligonucleotide 
may be conjugated or bound to the nuclease to allow for co-delivery of these 

20 molecules. Additional delivery systems may be employed. 

Inactivation of pathogenic mRNAs has been described using antisense gene 
regulation and using ribozymes (Rossi, U.S. Patent No. 5,144,019, hereby incorporated 
by reference). Both of these methodologies have limitations. 

The use of antisense RNA to impair gene expression requires stoichiometric 

25 and therefore, large molar excesses of anti-sense RNA relative to the pathogenic RNA 

to be effective. Ribozyme therapy, on the other hand, is catalytic and therefore lacks 
the problem of the need for a large molar excess of the therapeutic compound found 
with antisense methods. However, ribozyme cleavage of a given RNA requires the 
presence of highly conserved sequences to form the catalytically active cleavage 
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structure. This requires that the target pathogenic mRNA contain the conserved 
sequences (GAAAC (X) n GU) thereby limiting the number of pathogenic mRNAs that 
can be cleaved by this method. In contrast, the catalytic cleavage of RNA by the use 
of a DNA oligonucleotide and a 5' nuclease is dependent upon structure only; thus, 
5 virtually any pathogenic RNA sequence can be used to design an appropriate cleavage 

structure. 

IV. Detection Of Antigenic Or Nucleic Acid Targets By A Dual 
Capture Assay 

The ability to generate 5' nucleases from thermostable DNA polymerases 

10 provides the basis for a novel means of detecting the presence of antigenic or nucleic 

acid targets. In this dual capture assay, the polymerase domains encoding the synthetic 
activity and the nuclease activity are covalently attached to two separate and distinct 
antibodies or oligonucleotides. When both the synthetic and the nuclease domains are 
present in the same reaction and dATP, dTTP and a small amount of poly d(A-T) are 

15 provided, an enormous amount of poly d(A-T) is produced. The large amounts of 

poly d(A-T) are produced as a result of the ability of the 5' nuclease to cleave newly 
made poly d(A-T) to generate primers that are, in turn, used by the synthetic domain 
to catalyze the production of even more poly d(A-T). The 5' nuclease is able to 
cleave poly d(A-T) because poly d(A-T) is self-complementary and easily forms 

20 alternate structures at elevated temperatures. These structures are recognized by the 5' 

nuclease and are then cleaved to generate more primer for the synthesis reaction. 

The following is an example of the dual capture assay to detect an antigen(s): 
A sample to be analyzed for a given antigen(s) is provided. This sample may 
comprise a mixture of cells; for example, cells infected with viruses display virally- 

25 encoded antigens on their surface. If the antigen(s) to be detected are present in 

solution, they are first attached to a solid support such as the wall of a microtiter dish 
or to a bead using conventional methodologies. The sample is then mixed with 1) the 
synthetic domain of a thermostable DNA polymerase conjugated to an antibody which 
recognizes either a first antigen or a first epitope on an antigen, and 2) the 5' nuclease 
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domain of a thermostable DNA polymerase conjugated to a second antibody which 
recognizes either a second, distinct antigen or a second epitope on the same antigen as 
recognized by the antibody conjugated to the synthetic domain. Following an 
appropriate period to allow the interaction of the antibodies with their cognate antigens 
(conditions will vary depending upon the antibodies used; appropriate conditions are 
well known in the art), the sample is then washed to remove unbound antibody- 
enzyme domain complexes. dATP, dTTP and a small amount of poly d(A-T) is then 
added to the washed sample and the sample is incubated at elevated temperatures 
(generally in the range of 60-80°C and more preferably, 70-75°C) to permit the 
thermostable synthetic and 5' nuclease domains to function. If the sample contains the 
antigen(s) recognized by both separately conjugated domains of the polymerase, then 
an exponential increase in poly d(A-T) production occurs. If only the antibody 
conjugated to the synthetic domain of the polymerase is present in the sample such 
that no 5' nuclease domain is present in the washed sample, then only an arithmetic 
increase in poly d(A-T) is possible. The reaction conditions may be controlled in such 
a way so that an arithmetic increase in poly d(A-T) is below the threshold of detection. 
This may be accomplished by controlling the length of time the reaction is allowed to 
proceed or by adding so little poly d(A-T) to act as template that in the absence of 
nuclease activity to generate new poly d(A-T) primers very little poly d(A-T) is 
synthesized. 

It is not necessary for both domains of the enzyme to be conjugated to an 
antibody. One can provide the synthetic domain conjugated to an antibody and 
provide the 5' nuclease domain in solution or vice versa. In such a case the 
conjugated antibody-enzyme domain is added to the sample, incubated, then washed. 
dATP, dTTP, poly d(A-T) and the remaining enzyme domain in solution is then 
added. 

Additionally, the two enzyme domains may be conjugated to oligonucleotides 
such that target nucleic acid sequences can be detected. The oligonucleotides 
conjugated to the two different enzyme domains may recognize different regions on 
the same target nucleic acid strand or may recognize two unrelated target nucleic acids. 
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The production of poly d(A-T) may be detected in many ways including: 
1) use of a radioactive label on either the dATP or dTTP supplied for the synthesis of 
the poly d(A-T), followed by size separation of the reaction products and 
autoradiography; 2) use of a fluorescent probe on the dATP and a biotinylated probe 

5 on the dTTP supplied for the synthesis of the poly d(A-T), followed by passage of the 

reaction products over an avidin bead, such as magnetic beads conjugated to avidin; 
the presence of the florescent probe on the avidin-containing bead indicates that poly 
d(A-T) has been formed as the fluorescent probe will stick to the avidin bead only if 
the fluorescenated dATP is incorporated into a covalent linkage with the biotinylated 

10 dTTP; and 3) changes fluorescence polarization indicating an increase in size. Other 

means of detecting the presence of poly d(A-T) include the use of intercalating 
fluorescence indicators to monitor the increase in duplex DNA formation. 

The advantages of the above dual capture assay for detecting antigenic or 

nucleic acid targets include: 

15 i) No thermocycling of the sample is required. The polymerase domains 

and the dATP and dTTP are incubated at a fixed temperature (generally about 70°C). 
After 30 minutes of incubation up to 75% of the added dNTPs are incorporated into 
poly d(A-T). The lack of thermocycling makes this assay well suited to clinical 
laboratory settings; there is no need to purchase a thermocycling apparatus and there is 

20 no need to maintain very precise temperature control. 

2) The reaction conditions are simple. The incubation of the bound 
enzymatic domains is done in a buffer containing 0.5 mM MgCl 2 (higher 
concentrations may be used), 2-10 mM Tris-Cl, pH 8.5, approximately 50 uM dATP 
and dTTP. The reaction volume is 10-20 ul and reaction products are detectable 

25 within 10-20 minutes. 

3) No reaction is detected unless both the synthetic and nuclease activities 
are present. Thus, a positive result indicates that both probes (antibody or 
oligonucleotide) have recognized their targets thereby increasing the specificity of 
recognition by having two different probes bind to the target. 
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The ability to separate the two enzymatic activities of the DNAP allows for 
exponential increases in poly d(A-T) production. If a DNAP is used which lacks 5' 
nuclease activity, such as the Klenow fragment of DNAPEcl, only a linear or 
arithmetic increase in poly d(A-T) production is possible [Setlow et al, J. Biol. Chem. 
5 247:224 (1972)]. The ability to provide an enzyme having 5' nuclease activity but 

lacking synthetic activity is made possible by the disclosure of this invention. 

V. Cleavase™ Fragment Length Polymorphism For The Detection Of 
Secondary Structure 

Nucleic acids assume secondary structures which depend on base-pairing for 

10 stability. When single strands of nucleic acids (single-stranded DNA, denatured DNA 

or RNA) with different sequences, even closely related ones, are allowed to fold on 
themselves, they assume characteristic secondary structures. These differences in 
structures account for the ability of single strand conformation polymorphism (SSCP) 
analysis to distinguish between DNA fragments having closely related sequences. 

15 The 5' nuclease domains of certain DNA polymerases are specific 

endonucleases that recognize and cleave nucleic acids at specific structures rather than 
in a sequence-specific manner (as do restriction endonucleases). The isolated nuclease 
domain of DNAPTaq described herein (termed the Cleavase™ enzyme) recognizes the 
end of a duplex that has non-base paired strands at the ends. The strand with the 5' 

20 end is cleaved at the junction between the single strand and the duplex. 

Figure 29 depicts a wild-type substrate and a mutant substrate wherein the 
mutant substrate differs from the wild-type by a single base change (A to G as 
indicated). According to the method of the present invention, substrate structures form 
when nucleic acids are denatured and allowed to fold on themselves (See Figure 29, 

25 steps 1 and 2). The step of denaturation may be achieved by treating the nucleic acid 

with heat, low (<3) or high pH (>10), the use of low salt concentrations, the absence 
of cations, chemicals {e.g., urea, formamide) or proteins {e.g., helicases). Folding or 
renaturation of the nucleic acid is achieved by lowering of the temperature, addition of 
salt, neutralization of the pH, withdrawal of the chemicals or proteins. 
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The manner in which the substrate folds is dependent upon the sequence of the 
substrate. The 5' nucleases of the invention cleave the structures (See Figure 29, step 
3). The end points of the resulting fragments reflect the locations of the cleavage 
sites. The cleavage itself is dependent upon the formation of a particular structure, not 
upon a particular sequence at the cleavage site. 

When the 5' nucleases of the invention cleave a nucleic acid substrate, a 
collection of cleavage products or fragments is generated. These fragments constitute 
a characteristic fingerprint of the nucleic acid which can be detected [e.g., by 
electrophoresis on a gel (see step 4)]. Changes in the sequence of a nucleic acid (e.g., 
single point mutation between a wild-type and mutant gene) alter the pattern of 
cleavage structures formed. When the 5' nucleases of the invention cleave the 
structures formed by a wild-type and an altered or mutant form of the substrate, the 
distribution of the cleavage fragments generated will differ between the two substrates 
reflecting the difference in the sequence of the two substrates (See Figure 39, step 5). 

The Cleavase™ enzyme generates a unique pattern of cleavage products for a 
substrate nucleic acid. Digestion with the Cleavase™ enzyme can be used to detect 
single base changes in DNA molecules of great length (e.g., 1.6 kb in length) to 
produce a characteristic pattern of cleavage products. The method of the invention is 
termed "Cleavase™ Fragment Length Polymorphism" (CFLP™). However, it is noted 
that the invention is not limited to the use of the Cleavase™ enzyme; suitable 
enzymatic cleavage activity may be provided from a variety of sources including the 
Cleavase™ enzyme, Taq DNA polymerase, E. coli DNA polymerase I and eukaryotic 
structure-specific endonucleases (e.g., the yeast RAD2 protein and RAD1/RAD10 
complex [Harrington, JJ. and Liener (1994) Genes and Develop. 8:1344], murine 
FEN-1 endonucleases (Harrington and Liener, supra) and calf thymus 5' to 3' 
exonuclease [Murante, R.S., et al. (1994) J. Biol. Chem. 269:1191]). Indeed actual 
experimental data is provided herein which demonstrates that numerous enzymes may 
be used to generate a unique pattern of cleavage products for a substrate nucleic acid. 
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Enzymes which are shown herein to be suitable for use in the CFLP™ method include 
the Cleavase™ BN enzyme, Taq DNA polymerase, Tth DNA polymerase, Tfl DNA 
polymerase, E. coli Exo III, and the yeast Radl/RadlO complex. 

The invention demonstrates that numerous enzymes may be suitable for use in 
the CFLP™ method including enzymes which have been characterized in the literature 
a being 3' exonucleases. In order to test whether an enzyme is suitable for use as a 
cleavage means in the CFLP™ method (i.e., capable of generating a unique pattern of 
cleavage products for a substrate nucleic acid), the following steps are taken. Careful 
consideration of the steps described below allows the evaluation of any enzyme 
("enzyme X") for use in the CFLP™ method. 

An initial CFLP™ reaction is prepared using a previously characterized 
substrate nucleic acid [for example the 157 nucleotide fragment of exon 4 of the 
human tyrosinase gene (SEQ ID NO:47)]. The substrate nucleic acid (approximately 
100 fmoles; the nucleic acid template may contain a 5' end or other label to permit 
easy detection of the cleavage products) is placed into a thin wall microcentrifuge tube 
in a solution which comprises reaction conditions reported to be optimal for the 
characterized activity of the enzyme (i.e., enzyme X). For example, if the enzyme X 
is a DNA polymerase, the initial reaction conditions would utilize a buffer which has 
been reported to be optimal for the polymerization activity of the polymerase. If 
enzyme X is not a polymerase, or if no specific components are reported to be needed 
for activity, the initial reaction may be assembled by placing the substrate nucleic acid 
in a solution comprising IX CFLP™ buffer (10 mM MOPS, 0.05% TWEEN-20, 
0.05% NONIDET P-40), pH 7.2 to 8.2, 1 mM MnCl 2 . 

The substrate nucleic acid is denatured by heating the sample tube to 95°C for 
5 seconds and then the reaction is cooled to a temperature suitable for the enzyme 
being tested (e.g., if a thermostable polymerase is being tested the cleavage reaction 
may proceed at elevated temperatures such as 72°C; if a mesophilic enzyme is being 
tested the tube is cooled to 37°C for the cleavage reaction). Following denaturation 
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and cooling to the target temperature, the cleavage reaction is initiated by the addition 
of a solution comprising 1 to 200 units of the enzyme to be tested (i.e., enzyme X; the 
enzyme may be diluted into IX CFLP™ buffer, pH 8.2 if desired). 

Following the addition of the enzyme X solution, the cleavage reaction is 
allowed to proceed at the target temperature for 2 to 5 minutes. The cleavage reaction 
is then terminated [this may be accomplished by the addition of a stop solution (95% 
formamide, 10 mM EDTA, 0.05% bromophenol blue, 0.05% xylene cyanol)] and the 
cleavage products are resolved and detected using any suitable method (e.g., 
electrophoresis on a denaturing polyacrylamide gel followed by transfer to a solid 
support and nonisotopic detection). The cleavage pattern generated is examined by the 
criteria described below for the CFLP™ optimization test. 

An enzyme is suitable for use in the CFLP™ method if it is capable of 
generating a unique (i.e., characteristic) pattern of cleavage products from a substrate 
nucleic acid; this cleavage must be shown to be dependent upon the presence of the 
enzyme. Additionally, an enzyme must be able to reproducibly generate the same 
cleavage pattern when a given substrate is cleaved under the same reaction conditions. 
To test for reproducibility, the enzyme to be evaluated is used in at least two separate 
cleavage reactions run on different occasions using the same reaction conditions. If 
the same cleavage pattern is obtained on both occasions, the enzyme is capable of 
reproducibly generating a cleavage pattern and is therefore suitable for use in the 
CFLP™ method. 

When enzymes derived from mesophilic organisms are to be tested in the 
CFLP™ reaction they may be initially tested at 37°C. However it may be desirable to 
use theses enzymes at higher temperatures in the cleavage reaction. The ability to 
cleave nucleic acid substrates over a range of temperatures is desirable when the 
cleavage reaction is being used to detect sequence variation (i.e., mutation) between 
different substrates. Strong secondary structures that may dominate the cleavage 
pattern are less likely to be destabilized by single-base changes and may therefore 
interfere with mutation detection. Elevated temperatures can then be used to bring 
these persistent structures to the brink of instability, so that the effects of small 
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changes in sequence are maximized and revealed as alterations in the cleavage pattern. 
Mesophilic enzymes may be used at temperatures greater than 37°C under certain 
conditions known to the art. These conditions include the use of high (i.e., 10-30%) 
concentrations of glycerol in the reaction conditions. Furthermore, it is noted that 
5 while an enzyme may be isolated from a mesophilic organism this fact alone does not 

mean that the enzyme may not demonstrate thermostability; therefore when testing the 
suitability of a mesophilic enzyme in the CFLP™ reaction, the reaction should be run 
at 37°C and at higher temperatures. Alternatively, mild denaturants can be used to 
destablize the nucleic acid substrate at a lower temperature (e.g., 1-10% formamide, 1- 
10 10% DMSO and 1-10% glycerol have been used in enzymatic reactions to mimic 

thermal destablization). 

Nucleic acid substrates that may be analyzed using a cleavage means, such as a 
5' nuclease, include many types of both RNA and DNA. Such nucleic acid substrates 
may all be obtained using standard molecular biological techniques. For example, 
15 substrates may be isolated from a tissue sample, tissue culture cells, bacteria or viruses, 

may be transcribed in vitro from a DNA template, or may be chemically synthesized. 
Furthermore, substrates may be isolated from an organism, either as genomic material 
or as a plasmid or similar extrachromosomal DNA, or it may be a fragment of such 
material generated by treatment with a restriction endonuclease or other cleavage 
20 agents or it may be synthetic. 

Substrates may also be produced by amplification using the PCR. When the 
substrate is to be a single- stranded substrate molecule, the substrate may be produced 
using the PCR with preferential amplification of one strand (asymmetric PCR). 
Single-stranded substrates may also be conveniently generated in other ways. For 
25 example, a double- stranded molecule containing a biotin label at the end of one of the 

two strands may be bound to a solid support (e.g., a magnetic bead) linked to a 
streptavidin moiety. The biotin-labeled strand is selectively captured by binding to the 
streptavidin-bead complex. It is noted that the subsequent cleavage reaction may be 
performed using substrate attached to the solid support, as the Cleavase™ enzyme can 
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cleave the substrate while it is bound to the bead. A single- stranded substrate may 
also be produced from a double-stranded molecule by digestion of one strand with 
exonuclease. 

The nucleic acids of interest may contain a label to aid in their detection 
5 following the cleavage reaction. The label may be a radioisotope (e.g., a 32 P or 35 S- 

labeled nucleotide) placed at either the 5' or 3' end of the nucleic acid or alternatively 
the label may be distributed throughout the nucleic acid (i.e., an internally labeled 
substrate). The label may be a nonisotopic detectable moiety, such as a fluorophore 
which can be detected directly, or a reactive group which permits specific recognition 
10 by a secondary agent. For example, biotinylated nucleic acids may be detected by 

probing with a streptavidin molecule which is coupled to an indicator (e.g., alkaline 
phosphatase or a fluorophore), or a hapten such as digoxigenin may be detected using 
a specific antibody coupled to a similar indicator. Alternatively, unlabeled nucleic acid 
may be cleaved and visualized by staining (e.g., ethidium bromide staining) or by 
15 hybridization using a labeled probe. In a preferred embodiment, the substrate nucleic 

acid is labeled at the 5' end with a biotin molecule and is detected using avidin or 
streptavidin coupled to alkaline phosphatase. In another preferred embodiment the 
substrate nucleic acid is labeled at the 5' end with a fluorescein molecule and is 
detected using an anti-fluorescein antibody-alkaline phosphatase conjugate. 
20 The cleavage patterns are essentially partial digests of the substrate in the 

reaction. When the substrate is labelled at one end (e.g., with biotin), all detectable 
fragments share a common end. The extension of the time of incubation of the 
enzyme Cleavase™ reaction does not significantly increase the proportion of short 
fragments, indicating that each potential cleavage site assumes either an active or 
25 inactive conformation and that there is little inter-conversion between the states of any 

potential site, once they have formed. Nevertheless, many of the structures recognized 
as active cleavage sites are likely to be only a few base-pairs long and would appear to 
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be unstable at the elevated temperatures used in the Cleavase™ reaction. The 
formation or disruption of these structures in response to small sequence changes 
results in changes in the patterns of cleavage. 

The products of the cleavage reaction are a collection of fragments generated 

5 by structure specific cleavage of the input nucleic acid. Nucleic acids which differ in 

size may be analyzed and resolved by a number of methods including electrophoresis, 
chromatography, fluorescence polarization, mass spectrometry and chip hybridization. 
The invention is illustrated using electrophoretic separation. However, it is noted that 
the resolution of the cleavage products is not limited to electrophoresis. 

10 Electrophoresis is chosen to illustrate the method of the invention because 

electrophoresis is widely practiced in the art and is easily accessible to the average 
practitioner. 

If abundant quantities of DNA are available for the analysis, it may be 
advantageous to use direct fluorescence to detect the cleavage fragments, raising the 

15 possibility of analyzing several samples in the same tube and on the same gel. This 

"multiplexing" would permit automated comparisons of closely related substrates such 
as wild-type and mutant forms of a gene. 

The CFLP™ reaction is useful to rapidly screen for differences between similar 
nucleic acid molecules. To optimize the CFLP™ reaction for any desired nucleic acid 

20 system (e.g., a wild-type nucleic acid and one or more mutant forms of the wild-type 

nucleic acid), it is most convenient to use a single substrate from the test system (for 
example, the wild-type substrate) to determine the best CFLP™ reaction conditions. A 
single suitable condition is chosen for doing the comparison CFLP™ reactions on the 
other molecules of interest. For example, a cleavage reaction may be optimized for a 

25 wild-type sequence and mutant sequences may subsequently be cleaved under the same 

conditions for comparison with the wild-type pattern. The objective of the CFLP™ 
optimization test is the identification of a set of conditions which allow the test 
molecule to form an assortment (i.e., a population) of intra-strand structures that are 
sufficiently stable such that treatment with a structure-specific cleavage agent such as 
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the Cleavase™ enzyme or DNAPTaq will yield a signature array of cleavage products, 
yet are sufficiently unstable that minor or single-base changes within the test molecule 
are likely to result in a noticeable change in the array of cleavage products. 

The following discussion illustrates the optimization of the CFLP™ method for 
use with a single-stranded substrate. 

A panel of reaction conditions with varying salt concentration and temperature 
is first performed to identify an optimal set of conditions for the single-stranded 
CFLP tm_ -Optimal CFLP™" is defined for this test case as the set of conditions that 
yields the most widely spaced set of bands after electrophoretic separation, with the 
most even signal intensity between the bands. 

Two elements of the cleavage reaction that significantly affect the stability of 
the nucleic acid structures are the temperature at which the cleavage reaction is 
performed and the concentration of salt in the reaction solution. Likewise, other 
factors affecting nucleic acid structures, such as, formamide, urea or extremes in pH 
may be used. The initial test typically will comprise reactions performed at four 
temperatures (60°C, 65°C, 70°C and 75°C) in three different salt concentrations (0 
mM, 25 mM and 50 mM) for a total of twelve individual reactions. It is not intended 
that the present invention be limited by the salt utilized. The salt utilized may be 
chosen from potassium chloride, sodium chloride, etc. with potassium chloride being a 
preferred salt. 

For each salt concentration to be tested, 30 ul of a master mix containing a 
DNA substrate, buffer and salt is prepared. When the substrate is DNA, suitable 
buffers include 3-[N-Morpholino]propanesulfonic acid (MOPS), pH 6.5 to 9.0, with 
P H 7.5 to 8.4 being particularly preferred and other "Good" biological buffers such as 
tris[Hydroxymethyl]aminomethane (Tris) or N,N-bis[2-Hydroxyethyl]glycine (Bicine), 
pH 6.5 to 9.0, with pH 7.5 to 8.4 being particularly preferred. When the nucleic acid 
substrate is RNA, the pH of the buffer is reduced to the range of 6.0 to 8.5, with pH 
6.0 to 7.0 being particularly preferred. When manganese is to used as the divalent 
cation in the reaction, the use of Tris buffers is not preferred. Manganese tends to 
precipitate as manganous oxide in Tris if the divalent cation is exposed to the buffer 
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for prolonged periods (such as in incubations of greater than 5 minutes or in the 
storage of a stock buffer). When manganese is to be used as the divalent cation, a 
preferred buffer is the MOPS buffer. 

For reactions containing no salt (the "0 mM KC1" mix), the mix includes 
enough detectable DNA for 5 digests {e.g., approximately 500 fmoles of 5' 
biotmylated DNA or approximately 100 fmoles of «P-5' end labeled DNA) in 30 p.1 of 
IX CFU>™ buffer (10 mM MOPS, P H 8.2) with 1.7 mM MnCl 2 or MgCl 2 (the final 
concentration of the divalent cation will be 1 mM). Other concentrations of the 
divalent cation may be used if appropriate for the cleavage agent chosen {e.g., E. coli 
DNA polymerase I is commonly used in a buffer containing 5 mM MgCl 2 ). The "25 
mM KC1" mix includes 41.5 mM KC1 in addition to the above components; the "50 
mM KC1" mix includes 83.3 mM KC1 in addition to the above components. 

The mixes are distributed into labeled reaction tubes (0.2 ml, 0.5 ml or 1.5 ml 
"Eppendorf 1 style microcentrifuge tubes) in 6 nl aliquots, overlaid with light mineral 
oil or a similar barrier, and stored on ice until use. Sixty microliters of an enzyme 
dilution cocktail is assembled, comprising a 5' nuclease at a suitable concentration in 
IX CFLP™ buffer without MnCl 2 . Preferred 5' nucleases and concentrations are 750 
ng of the Cleavase™ enzyme BN or 15 units of Tag DNA polymerase (or another 
eubacterial Pol A-type DNA polymerase). Suitable amounts of a similar structure- 
specific cleavage agent in IX CFLP™ buffer without MnCl 2 may also be utilized. 

If a strong {i.e., stable) secondary structure is formed by the substrates, a single 
nucleotide change is unlikely to significantly alter that structure, or the cleavage 
pattern it produces. Elevated temperatures can be used to bring structures to the brink 
of instability, so that the effects of small changes in sequence are maximized, and 
revealed as alterations in the cleavage pattern within the target substrate, thus allowing 
the cleavage reaction to occur at that point. Consequently, it is often desirable to run 
the reaction at an elevated temperature {i.e., above 55°C). 
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Preferably, reactions are performed at 60°C, 65°C, 70°C and 75°C. For each 
temperature to be tested, a trio of tubes at each of the three KC1 concentrations are 
brought to 95°C for 5 seconds, then cooled to the selected temperature. The reactions 
are then started immediately by the addition of 4 ul of the enzyme cocktail. A 

5 duplicate trio of tubes may be included (these tubes receiving 4 ul of IX CFLP™ 

buffer without enzyme or MnCl 2 ), to assess the nucleic add stability in these reaction 
conditions. All reactions proceed for 5 minutes, and are stopped by the addition of 8 
111 of 95% formamide with 20 mM EDTA and 0.05% xylene cyanol and 0.05% 
bromophenol blue. Reactions may be assembled and stored on ice if necessary. 

10 Completed reactions are stored on ice until all reactions in the series have been 

performed. 

Samples are heated to 72°C for 2 minutes and 5 ^ of each reaction is resolved 
by electrophoresis through a suitable gel, such as 6 to 10% polyacrylamide (19:1 
cross-link), with 7M urea, in a buffer of 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA 
15 for nucleic acids up to approximately 1.5 kb, or native or denaturing agarose gels for 

larger molecules. The nucleic acids may be visualized as described above, by staining, 
autoradiography (for radioisotopes) or by transfer to a nylon or other membrane 
support with subsequent hybridization and/or nonisotopic detection. The patterns 
generated are examined by the criteria described above and a reaction condition is 
20 chosen for the performance of the variant comparison CFLP™s. 

A "no enzyme" control allows the assessment of the stability of the nucleic acid 
substrate under particular reaction conditions. In this instance, the substrate is placed 
in a tube containing all reaction components except the enzyme and treated the same 
as the enzyme-containing reactions. Other control reactions may be run. A wild-type 
25 substrate may be cleaved each time a new mutant substrate is tested. Alternatively, a 

previously characterized mutant may be run in parallel with a substrate suspected of 
containing a different mutation. Previously characterized substrates allow for the 
comparison of the cleavage pattern produced by the new test substrate with a known 
cleavage pattern. In this manner, alterations in the new test substrate may be 
30 identified. 
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When the CFLP™ pattern generated by cleavage of a single- stranded substrate 
contains an overly strong {i.e., intense) band, this indicates the presence of a very 
stable structure. The preferred method for redistributing the signal is to alter the 
reaction conditions to increase structure stability {e.g., lower the temperature of the 
cleavage reaction, raise the monovalent salt concentration); this allows other less stable 
structures to compete more effectively for cleavage. 

When the single- stranded substrate is labelled at one end {e.g., with biotin or 
32P) all detectable fragments share a common end. For short DNA substrates (less 
than 250 nucleotides) the concentration of the enzyme {e.g., Cleavase™ BN enzyme) 
and the length of the incubation have minimal influence on the distribution of signal 
intensity, indicating that the cleavage patterns are not partial digests of a single 
structure assumed by the nucleic acid substrate, but rather are relatively complete 
digests of a collection of stable structures formed by the substrate. With longer DNA 
substrates (greater than 250 nucleotides) there is a greater chance of having multiple 
cleavage sites on each structure, giving apparent overdigestion as indicated by the 
absence of any residual full-length materials. For these DNA substrates, the enzyme 
concentration may be lowered in the cleavage reaction (for example, if 50 ng of the 
Cleavase™ BN enzyme were used initially and overdigestion was apparent, the 
concentration of enzyme may be reduced to 25, 10 or 1 ng per reaction). 

When the CFLP™ reaction is to optimized for the cleavage a double- stranded 
substrate the following steps are taken. The cleavage of double-stranded DNA 
substrates up to 2,000 base pairs may be optimized in this manner. 

The double-stranded substrate is prepared such that it contains a single end- 
label using any of the methods known to the art. The molar amount of DNA used in 
25 the optimization reactions is the same as that use for the optimization of reactions 

utilizing single-stranded substrates. The most notable differences between the 
optimization of the CFLP™ reaction for single- versus double-stranded substrates is 
that the double-stranded substrate is denatured in distilled water without buffer, the 
concentration of MnCl 2 in the reaction is reduced to 0.2 mM, the KC1 (or other 
monovalent salt) is omitted, and the enzyme concentration is reduced to 10 to 25 ng 
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per reaction. In contrast to the optimization of the single-stranded CFLP™ reaction 
(described above) where the variation of the monovalent salt {e.g., KC1) concentration 
is a critical controlling factor, in the optimization of the double-stranded CFLP™ 
reaction the range of temperature is the more critical controlling factor for optimization 
of the reaction. When optimizing the double-stranded CFLP™ reaction a reaction tube 
containing the substrate and other components described below is set up to allow 
performance of the reaction at each of the following temperatures: 40°C, 45°C, 50°C, 

55°C, 60°C, 65°C, 70°C, and 75°C. 

For each temperature to be tested, a mixture comprising the single end labelled 
double-stranded DNA substrate and distilled water in a volume of 15 ul is prepared 
and placed into a thin walled microcentrifuge tube. This mixture may be overlaid with 
light mineral oil or liquid wax (this overlay is not generally required but may provide 
more consistent results with some double-stranded DNA substrates). 

A 2 mM solution of MnCl 2 is prepared. For each CFLP™ reaction, 5 ul of a 
diluted enzyme solution is prepared comprising 2 ul of 10X CFLP™ buffer (100 mM 
MOPS, pH 7.2 to 8.2, 0.5% TWEEN-20, 0.5% NONIDET P-40), 2 ul of 2 mM 
MnCl 2 and 25 ng of Cleavase™ BN enzyme and distilled water to yield a final volume 
of 5 ul. 

The DNA mixture is heated to 95°C for 10 to 30 seconds and then individual 
tubes are cooled to the reaction temperatures to be tested {e.g., 40°C, 45°C, 50°C, 
55°C, 60°C, 65°C, 70°C, and 75°C). The cleavage reaction is started by adding 5 ul 
of the dilute enzyme solution to each tube at the target reaction temperature. The 
reaction is incubated at the target temperature for 5 minutes and the reaction is 
terminated {e.g., by the addition of 16 ul of stop solution comprising 95% formamide 
with 10 mM EDTA and 0.05% xylene cyanol and 0.05% bromophenol blue). 

Samples are heated to 72°C for 1 to 2 minutes and 3 to 7 ul of each reaction is 
resolved by electrophoresis through a suitable gel, such as 6 to 10% polyacrylamide 
(19:1 cross-link), with 7M urea, in a buffer of 45 mM Tris-Borate, pH 8.3, 1.4 mM 
EDTA for nucleic acids up to approximately 1.5 kb, or native or denaturing agarose 
gels for larger molecules. The nucleic acids may be visualized as described above, by 
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staining, autoradiography (for radioisotopes) or by transfer to a nylon or other 
membrane support with subsequent hybridization and/or nonisotopic detection. The 
patterns generated are examined by the criteria described above and a reaction 
condition is chosen for the performance of the double-stranded CFLP™. 

A "no enzyme" control allows the assessment of the stability of the nucleic acid 
substrate under particular reaction conditions. In this instance, the substrate is placed 
in a tube containing all reaction components except the enzyme and treated the same 
as the enzyme-containing reactions. Other control reactions may be run. A wild-type 
substrate may be cleaved each time a new mutant substrate is tested. Alternatively, a 
previously characterized mutant may be run in parallel with a substrate suspected of 
containing a different mutation. Previously characterized substrates allow for the 
comparison of the cleavage pattern produced by the new test substrate with a known 
cleavage pattern. In this manner, alterations in the new test substrate may be 
identified. 

When performing double-stranded CFLP™ reactions the MnCl 2 concentration 
preferably will not exceed 0.25 mM. If the end label on the double-stranded DNA 
substrate disappears (i.e., loses its 5' end label as judged by a loss of signal upon 
detection of the cleavage products), the concentration of MnCl 2 may be reduced to 0.1 
mM. Any EDTA present in the DNA storage buffer will reduce the amount of free 
Mn 2+ in the reaction, so double-stranded DNA should be dissolved in water or Tris- 
HC1 with a EDTA concentration of 0.1 mM or less. 

Cleavage products produced by cleavage of either single-or double-stranded 
substrates which contain a biotin label may be detected using the following nonisotopic 
detection method. After electrophoresis of the reaction products, the gel plates are 
separated allowing the gel to remain flat on one plate. A positively charged nylon 
membrane (preferred membranes include NYTRAN® P lus, 0.2 or 0.45 mm-pore size, 
Schleicher and Schuell, Keene, NH), cut to size and pre-wetted in 0.5X TBE (45 mM 
tris-Borate, pH 8.3, 1.4 mM EDTA), is laid on top of the exposed gel. All air bubbles 
trapped between the gel and the membrane are removed (e.g., by rolling a 10 ml pipet 
firmly across the membrane). Two pieces of 3 MM filter paper (Whatman) are then 
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placed on top of the membrane, the other glass plate is replaced, and the sandwich is 
clamped with binder clips or pressed with books or weights. The transfer is allowed 
to proceed 2 hours to overnight (the signal increases with longer transfer). 

After transfer, the membrane is carefully peeled from the gel and allowed to air 
dry. Distilled water from a squeeze bottle can be used to loosen any gel that sticks to 
the membrane. After complete drying, the membrane is agitated for 30 minutes in 
1.2X Sequenase IMAGES Blocking Buffer (United States Biochemical, Cleveland, 
OH; avoid any precipitates in the blocking buffer by decanting or filtering); 0.3 ml of 
the buffer is used per cm 2 of membrane {e.g., 30 mis for a 10cm x 10cm blot). A 
streptavidin-alkaline phosphatase conjugate (SAAP, United Stated Biochemical) is 
added at a 1:4000 dilution directly to the blocking solution (avoid spotting directly on 
membrane), and agitated for 15 minutes. The membrane is rinsed briefly with dH 2 0 
and then washed 3 times (5 minutes of shaking per/wash) in IX SAAP buffer (100 
mM Tris-HCl, pH 10; 50 mM NaCl) with 0.1% sodium dodecyl sulfate (SDS), using 
0.5 ml buffer/cm 2 of membrane, with brief water rinses between each wash. The 
membrane is then washed twice in IX SAAP buffer (no SDS) with 1 mM MgCl 2 , 
drained thoroughly, and placed in a plastic heat-sealable bag. Using a sterile pipet tip, 
0.05 ml/cm 2 of CDP-Star™ (Tropix, Bedford, MA) is added to the bag and distributed 
over the entire membrane for 5 minutes. The bag is drained of all excess liquid and 
air bubbles, sealed, and the membrane is exposed to X-ray film {e.g., Kodak XRP) for 
30 minutes. Exposure times are adjusted as necessary for resolution and clarity. 

To date, every nucleic acid substrate tested in the CFLP™ system has produced 
a reproducible pattern of fragments. The sensitivity and specificity of the cleavage 
reaction make this method of analysis very suitable for the rapid screening of 
25 mutations in cancer diagnostics, tissue typing, genetic identity, bacterial and viral 

typing, polymorphism analysis, structure analysis, mutant screening in genetic crosses, 
etc. It could also be applied to enhanced RNA analysis, high level multiplexing and 
extension to longer fragments. One distinct benefit of using the Cleavase™ reaction to 
characterize nucleic acids is that the pattern of cleavage products constitutes a 
30 characteristic fingerprint, so a potential mutant can be compared to previously 
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characterized mutants without sequencing. Also, the place in the fragment pattern 
where a change is observed gives a good indication of the position of the mutation. 
But it is noted that the mutation need not be at the precise site of cleavage, but only m 
an area that affects the stability of the structure. 
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VI. Detection of Mutations in the p53 Tumor Suppressor Gene Using the 
CFLP™ Method 

Tumor supressor genes control cellular proliferation and a variety of other 
processes important for tissue homeostasis. One of the most extensively studied of 
these, the P 53 gene, encodes a regulator of the cell cycle machinery that can suppress 
the growth of cancer cells as well as inhibit cell transformation (Levine, Annu. Rev. 
Biochem. 62:623 [1993]). Tumor supressor mutations that alter or obliterate normal 

p53 function are common. 

Mutations in the P 53 tumor supressor gene are found in about half of all cases 
of human cancer making alterations m the P 53 gene the most common cancer-related 
genetic change known at the gene level. In the wild-type or non-mutated form, the 
P 53 gene encodes a 53-kD nuclear phosphoprotein, comprising 393 amino acids, which 
is involved in the control of cellular proliferation. Mutations in the p53 gene are 
generally (greater than 90%) missense mutations which cause a change in the identity 
of an amino acid rather than nonsense mutations which cause inactivation of the 
20 protein. It has been postulated that the high frequency of P 53 mutation seen in human 

tumors is due to the fact that the missense mutations cause both a loss of tumor 
supressor function and a gain of oncogenic function [Lane, D.P. and Benchimol, S., 

Genes Dev. 4:1 (1990)]. 

The gene encoding the P 53 protein is large, spanning 20,000 base pairs, and is 
25 divided into 1 1 exons (see Figure 76). The ability to scan the large P 53 gene for the 

presence of mutations has important clinical applications. In several major human 
cancers the presence of a tumor P 53 mutation is associated with a poor prognosis. P 53 
mutation has been shown to be an independent marker of reduced survival in lymph 
node-negative breast cancers, a finding that may assist clinicians in reaching decisions 
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regarding more aggressive therapeutic treatment. Also, Lowe and co-workers have 
demonstrated that the vulnerability of tumor cells to radiation or chemotherapy is 
greatly reduced by mutations which abolish p53-dependent apoptosis [Lowe et al, Cell 
74:957 (1995)]. 

5 Regions of the P 53 gene from approximately 10,000 tumors have been 

sequenced in the last 4 to 5 years, resulting in characterization of over 3,700 mutations 
of which approximately 1,200 represent independent p53 mutations (i.e., point 
mutations, insertion or deletions). A database has been compiled and deposited with 
the European Molecular Biology Laboratory (EMBL) Data Library and is available in 
10 electronic form [Hollstem, M. et al. (1994) Nucleic Acids Res. 22:3551 and Canello, 

N.F. et al. (1994) Nucleic Acids Res. 22:3549]. In addition, an IBM PC compatible 
software package to analyze the information in the database has been developed. 
[Cariello et al., Nucel. Acids Res. 22:3551 (1994)]. The point mutations in the 
database were identified by DNA sequencing of PCR-amplified products. In most 
1 5 cases, preliminary screening for mutations by SSCP or DGGE was performed. 

Analysis of the P 53 mutations shows that the p53 gene contains 5 hot spot 
regions (HSR) most frequently mutated in human tumors that show a tight correlation 
between domains of the protein that are evolutionary highly conserved (ECDs) and 
seem to be specifically involved in the transformation process (see Figure 76; the 
20 hieght of the bar represent the relative percentage of total mutations associated with 

the five HSRs). The five HSRs are confined to exons 5 to 8 and account for over 
85% of the mutations detected. However, becuase these studies generally confined 
their analysis to PCR amplifications and sequencing of regions located between exons 
5 to 8, it should be kept in mind that mutations outside this region are 
25 underrepresented. As 10% to 15% of the mutations lie outside this region, a clinically 

effective p53 gene DNA diagnostic should be able to cost-effectively scan for life- 
threatening mutations scattered across the entire gene (33). 
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The following table lists a number of the known p53 mutations. 

HUMAN p53 GENE MUTATIONS 

TABLE 2 



GODON NO. 


WILD-TYPE 


MUTANT 




TUMOR TYPE 


36 


CCG 


CCA 


W v_/ — T r\ 1 


Lung 


49 


GAT 


CAT 


— r\_/VJ 


CML 


53 


TGG 


TGT 


p,p v.T A 

UL- — T 1 r\ 


CML 


60 


CCA 


TCA 


vAT 


CML 


68 


GAG 


TAG 


UL — r 1 r\ 


SCLC 


110 


CGT 


TGT 


VjV_y Tl\ 1 


Hepatoca 


113 


TTC 


TGT 


L/OUDlc 1V1 


NSCLC 


128 


CCT 


CCG 


1 — >VJ 


Breast 


128 




TCT 


— T 1 


Breast 


129 


GCC 


GAC 




Neurofibrosa 


130 


CTC 


CTG 


Ljv^ — rLU 


MDS 


132 


AAG . 


AAC 


uL — >^vj 


Colorectal ca 


132 




CAG 


A 1 — 


Breast ca 


132 




AAT 


/T"' vT A 

LiL- — 1 1 A 


T une ("NSCLC) ca 


132 




CAG 


A 1 — ><^*j 


Pancreatic ca 


132 




AGG 


A T vPP 


CML 


1 1 1 


ATG 


TTG 


AT->TA 


Colorectal ca 


133 




AAG 


AT->TA 


Burkitt lymphoma 


134 


TTT 


TTA 


AT->TA 


Lung (SCLC) ca 


135 


TGC 


TAC 


GC->AT 


Colorectal ca 


135 




TCC 


GC->CG 


AML 


135 




TAC 


GC-»AT 


Lung (NSCLC) ca 


135 




TGG 


GC->CG 


MDS 


136 


CAA 


GAG 


Double M 


Breast ca 


138 


GCC 


GTC 


GC-»AT 


Rhabdomyosa 


138 




GGC 


GC->CG 


Lung (SCLC) ca 
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140 



141 
141 



144 



151 



151 



152 



152 



154 



154 



156 



156 



156 



156 



157 



157 



157 



157 



157 



157 



158 



158 



159 



159 



163 



ACC 
TGC 



TAC 
TAC 



AT-»TA 
GC-»AT 



TAC 



GC->AT 



GTG 



GCG 



AT-»GC 



TTG 



GC-VTA 



CAG 



TAG 



GC->AT 



CCG 



AT->CG 



CCC 



CAT 



Double M 



CAC 



GC->TA 



TCC 



GC->AT 



TCC 



GC->AT 



CCG 



CTG 



GCh>AT 



TCG 



GC->AT 



GGC 



GTC 



GC-»TA 



GTC 



GC->TA 



GTC 



GC-»TA 



GTC 



GC->TA 



CGC 



CCC 



GCh>CG 



CCC 



GC->CG 



CGT 



GC->AT 



CCC 



GC->CG 



GTC 



TTC 



GC->TA 



TTC 



GC->TA 



TTC 



GC-»TA 



TTC 



GC-»TA 



TTC 



GC->TA 



TTC 



GC-*TA 



CGC 



CGT 



GC-»AT 



CAC 



GC->AT 



GCC 



GTC 



GC->AT 



TAC 



CCC 
TGC 



GC->CG 
AT->GC 



CML 
Colorectal ca 



Bladder ca 



Colorectal ca 



Lung (NSCLC) ca 



Esophageal ca 



Burkitt lymphoma 



Leiomyosa 



Lung (SCLC) ca 



Glioblastoma 



Lung (NSCLC) ca 



Leiomyosa 



Breast ca 



Esophageal ca 



Lung (NSCLC) ca 



Lung (NSCLC) ca 



Lung (NSCLC) ca 



Rhabdomyosa 



Osteosa 



Lung (NSCLC) ca 



Lung (NSCLC) ca 



Hepatoca 



Lung (SCLC) ca 



Lung (NSCLC) ca 



Breast ca 



Lung (SCLC) ca 



Bladder ca 



Neurofibrosa 



Burkitt lymphoma 



Lung (NSCLC) ca 



Lung (NSCLC) ca 
Breast ca 
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CAC 



AT->GC 



Burkitt lymphoma 



164 



AAG 



CAG 



AT->CG 



Breast ca 



171 

172 



GAG 



TAG 



GC->TA 



GTT 



TTT 



GC-*TA 



Lung (SCLC) ca 



Burkitt lymphoma 



173 



GTG 



TTG 



GC->TA 



Lung (NSCLC) ca' 



173 



TTG 



GC-VTA 



Lung (NSCLC) ca 



173 



GGG 



AT-»CG 



Burkitt lymphoma 



173 



GTA 



GC->AT 



Gastric ca 



CGC 



CAC 



GC-»AT 



Colorectal ad 



175 



CAC 



GC->AT 



Colorectal ad 



CAC 



GC->AT 



Colorectal ad 



175 



CAC 



GC-»AT 



Colorectal ca 



175 



CAC 



GC->AT 



Colorectal ca 



175 



CAC 



GC->AT 



T-ALL 



175 



CAC 



GC-»AT 



Brain tumor 



CAC 



GC-»AT 



Colorectal ca 



CAC 



GC->AT 



Colorectal ca 



175 



CAC 



GC->AT 



Leiomyosa 



175 



CAC 



GC->AT 



Esophageal ca 



175 



CAC 



GC->AT 



Glioblastoma 



175 



CAC 



GC->AT 



Colorectal ca 



175 



CAC 



GC->AT 



T-ALL 



CAC 



GC->AT 



Breast ca 



175 



CTC 



GC->TA 



Breast ca 



175 



AGC 



GC->TA 



Hepatoca 



175 



CAC 



GC->AT 



B-ALL 



175 



CAC 



GC-»AT 



B-ALL 



175 
175 



CAC 



GC->AT 



CAC 



GC->AT 



Burkitt lymphoma 



Burkitt lymphoma 



175 



CAC 



GC->AT 



Burkitt lymphoma 



175 
175 



CAC 



GC-»AT 



Burkitt lymphoma 



CAC 



GC->AT 



Gastric ca 
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10 



15 



20 



25 



30 



176 



176 
176 



176 



177 



179 



179 



179 



179 



179 



179 



181 



187 



192 



193 



193 



193 



194 



194 



194 



194 



194 



196 



196 



196 



196 



196 



198 



198 



202 



204 



205 



TGC 



TTC 



GC->TA 



TTC 



GC->>TA 



TTC 



GC->TA 



TAC 



GC->AT 



CCC 



CGC 



GC->CG 



CAT 



TAT 



GC->AT 



CAG 



AT->CG 



CTT 



AT->TA 



GAT 



GC->CG 



CTT 



AT-+TA 



CTT 



AT->TA 



CGC 



CAC 



GC->AT 



GGT 



TGT 



GC->TA 



CAG 



TAG 



GC->AT 



CAT 



CGT 



AT-»GC 



TAT 



GC->AT 



CGT 



AT->GC 



CTT 



TTT 



GC->AT 



CGT 



AT->CG 



CGT 



AT->CG 



CGT 



AT-»CG 



CGT 



AT-*CG 



CGA 



TGA 



GC->AT 



TGA 



GC->AT 



TGA 



GC->AT 



TGA 



GC->AT 



TGA 



GC->AT 



GAA 



TAA 



GC->TA 



TAA 



GC->TA 



CGT 



CTT 



GC->TA 



GAG 



GGG 



AT->GC 



TAT 



TGT 



AT->GC 



Lung (NSCLC) ca 



Esophageal ca 



Lung (NSCLC) ca 



Burkitt lymphoma 



PTLC 



Neurofibrosa 



Lung (SCLC) ca 



Esophageal ca 



Breast ca 



Cholangiosa 



Cholangiosa 



Li-Fraumeni sdm 



Breast ca 



Esophageal ca 



Lung (SCLC) ca 



Esophageal ca 



AML 



Breast ca 



Lung (SCLC) ca 



Esophageal ca 



Esophageal ca 



B-CLL 



Colorectal ca 



T-ALL 



T-cell lymphoma 



Lung (SCLC) ca 



Bladder ca 



Lung (SCLC) ca 



Lung (SCLC) ca 



CML 



CML 



B-ALL 
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10 



15 



20 



25 



30 



205 



205 



211 



213 



213 



213 



213 



213 



213 



213 



213 



213 



215 



216 



216 



216 



216 



220 



229 



232 



234 



234 



234 



236 



237 



237 



237 



237 



237 



237 
238 



238 



TGT 
TTT 



AT->GC 
AT-»TA 



ACT 



GCT 



AT->GC 



CGA 



TGA 



GC->AT 



CAA 



GC->AT 



CAA 



GC->AT 



CGG 



AT-»GC 



CGG 



AT-»GC 



TGA 



GC->AT 



CGG 



AT->GC 



TGA 



GC->AT 



TGA 



GC->AT 



AGT 



GGT 



AT->GC 



GTG 



ATG 



GC->AT 



GAG 



AT->TA 



TTG 



GC->TA 



ATG 



GC->AT 



TAT 



TGT 



AT->GC 



TGT 



TGA 



AT->TA 



ATC 



AGC 



AT->CG 



TAC 



CAC 



AT^GC 



CAC 



AT^GC 



TGC 



AT->GC 



TAC 



TGC 



AT->GC 



ATG 



AGG 



AT->CG 



ATA 



GC->AT 



ATA 



GC->AT 



ATA 



GC->AT 



ATA 



GC-»AT 



TGT 



ATA 
TTT 



GC->AT 
GC->TA 



TAT 



GC->AT 



B-CLL 
Gastric ca 



Colorectal ca 



Colorectal ca 



B-cell lymphoma 



Burkitt lymphoma 



Lung (SCLC) ca 



Esophageal ca 



Lung (NSCLC) ca 



Lung (NSCLC) ca 



Burkitt lymphoma 



Burkitt lymphoma 



Colorectal ca 



Brain tumor 



Burkitt lymphoma 



Gastric ca 



Ovarian ca 



Colorectal ca 



Lung (SCLC) ca 



B-CLL 



B-cell lymphoma 



Burkitt lymphoma 



Burkitt lymphoma 



Burkitt lymphoma 



T-ALL 



Lung (SCLC) ca 



AML 



Breast ca 



Burkitt lymphoma 



Richter's sdm 



Larynx ca 



Burkitt lymphoma 
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10 



15 



20 
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30 
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10 



15 



20 



25 



30 



247 
248 



248 
248 
248 
248 
248 
248 
248 
248 
248 
248 
248 
248 
248 
248 
248 
248 
248 
248 
248 
248 
248 
248 
248 
248 
248 



248 
248 
248 



AAC 
CGG 



ATC 
TGG 



TGG 
CAG 
CAG 
CAG 
CAG 
TGG 
TGG 
TGG 
TGG 
TGG 
CTG 
TGG 
CAG 
CTG 
CAG 
TGG 
CTG 
TGG 
CAG 
CAG 
TGG 
CAG 
CAG 
CAG 
TGG 



AT->TA 
GC->AT 



CAG 
TGG 
CAG 



248 
248 



TGG 
CAG 



GC->AT 
GC-»AT 
GC->AT 
GC-^AT 
GC->AT 
GC->AT 
GC->AT 
GC->AT 
GC->AT 
GC->AT 
GC->TA 
GCh>AT 
GC-»AT 
GC->TA 
GC->AT 
GC->AT 
GC-»TA 
GC->AT 
GC-»AT 
GC->AT 
GC->AT 
GC-»AT 
GC->AT 
GC->AT 
GC-^AT 



GC->AT 
GC->AT 
GC-»AT 



GC->AT 
GC-»AT 



L ung (NSCLC) ca 
Colorectal ad 



Colorectal ca 
C olorectal ca 
Co lorectal ca 

T-ALL 
E sophageal ca 
Li-Fraumeni sdm 
L i-Fraumeni sdm 
Colorectal ca 
Col orectal ca 
Rhabdomyosa 
Esophageal ca 
Lung (NSCLC) ca 
Lung (SCLC) ca 
Lun g (SCLC) ca 

T-ALL 
Lung (NSCLC) ca 
Lung (SCLC) ca 
Co lorectal ca 
Bl adder ca 
MPS 
Bur kitt lymphoma 
Breast ca 
B-CLL 
Burkitt lymphoma 
Burkitt lymphoma 



Burkitt lymphoma 
Bu rkitt lymphoma 
Gastric ca 



Lung (SCLC) ca 
Breast ca 
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15 
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30 



248 
248 
248 



248 



249 



249 



249 



249 



249 



249 



249 



249 



249 



249 



249 



249 



249 



249 



249 



250 



251 



252 



252 



254 



254 



256 



258 



258 



258 



259 



260 



266 



CAG 
TGG 



GC->AT 
GC-»AT 



CAG 



GC->AT 



TGG 



GC->AT 



AGG 



AGT 



GC->TA 



AGT 



GC->TA 



AGT 



GC->TA 



AGC 



GC->CG 



AGT 



GC-»TA 



AGT 



GC->TA 



AGT 



GC->TA 



AGT 



GC->TA 



AGT 



GC->TA 



AGT 



GC->TA 



AGT 



GC-VTA 



AGT 



GC-VTA 



AGC 



GC->CG 



AGT 



GC-VTA 



AGT 



GC->TA 



CCC 



CTC 



GC->AT 



ATC 



AGC 



AT->CG 



CTC 



CCC 



AT-^GC 



CTC 



CCC 



AT-*GC 



ATC 



GAC 



Double M 



AAC 



AT->TA 



ACA 



GCA 



AT->GC 



GAA 



AAA 



GC->AT 



AAA 



GC->AT 



AAA 



GC->AT 



GAC 



GGC 



AT->GC 



TCC 



GCC 



AT-)>CG 



GGA 



GTA 



GC-VTA 



CML 
Li-Fraumeni sdm 



Li-Fraumeni sdm 



Colorectal ca 



Hepatoca 



Hepatoca 



Hepatoca 



Hepatoca 



Hepatoca 



Hepatoca 



Hepatoca 



Hepatoca 



Hepatoca 



Hepatoca 



Hepatoca 



Esophageal ca 



Breast ca 



Lung (NSCLC) ca 



Hepatoca 



Burkitt lymphoma 



Gastric ca 



Li-Fraumeni sdm 



Li-Fraumeni sdm 



Burkitt lymphoma 



Breast ca 



T-ALL 



Li-Fraumeni sdm 



Burkitt lymphoma 



Li-Fraumeni sdm 



T-ALL 



T-ALL 



Lung (NSCLC) ca 
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266 



GTA 



GC->TA 



Lung (NSCLC) ca 



266 
267 



GTA 



GC-VTA 



CGG 



CCG 



GC->CG 



Breast ca 



Lung (SCLC) ca 



270 



TTT 



TGT 



AT-»CG 



Esophageal ca 



270 



TGT 



AT-*CG 



T-ALL 



272 



GTG 



ATG 



GC->AT 



Brain tumor 



272 



CTG 



GC->CG 



Lung (SCLC) ca 



272 



ATG 



GC->AT 



Hepatoca 



272 



ATG 



GC->AT 



AML 



273 



CGT 



TGT 



GC->AT 



Colorectal ad 



273 



TGT 



GC-»AT 



Brain tumor 



273 



CAT 



GC->AT 



Breast ca 



273 



CAT 



GC->AT 



Colorectal ca 



273 



TGT 



GC->AT 



Lung (NSCLC) ca 



273 



CTT 



GC->TA 



Lung (SCLC) ca 



273 



CAT 



GC->AT 



Colorectal ca 



273 



CAT 



GC->AT 



Colorectal ca 



273 



CAT 



GC->AT 



Colorectal ca 



273 



CAT 



GC->AT 



Lung (NSCLC) ca 



273 



CCT 



GC-»CG 



Lung (NSCLC) ca 



273 



CTT 



GC->TA 



Lung (NSCLC) ca 



273 



CTT 



GC->TA 



Lung (NSCLC) ca 



273 



CAT 



GC->AT 



Thyroid ca 



273 



CAT 



GC->AT 



Lung (SCLC) ca 



273 



TGT 



GC-»AT 



B-cell lymphoma 



273 



TGT 



GC->AT 



B-ALL 



273 



TGT 



GC->AT 



Burkitt lymphoma 



273 



TGT 



GC-»AT 



Burkitt lymphoma 



273 



CAT 



GC->AT 



Li-Fraumeni sdm 



273 
273 



273 



TGT 
TGT 



CAT 



GC->AT 
GC-»AT 
GC->AT 



Cervical ca 
AML 



B->CLL 
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« 



25 



30 



273 



274 



276 



276 



277 



278 



278 



278 



278 



278 



280 



280 



281 



281 



281 



281 



282 



282 



282 



282 



282 



282 



282 



282 



282 



282 



282 



282 



282 



283 



283 



285 



GTT 



CTT 
GAT 



GC->TA 
AT-»TA 



GCC 



CCC 



GC->CG 



GAC 



GC->TA 



TGT 



TTT 



GC->TA 



CCT 



TCT 



GC->AT 



CTT 



GC->AT 



GCT 



GC-+CG 



TCT 



GC-»AT 



CGT 



GC->CG 



AGA 



AAA 



GC->AT 



AAA 



GC->AT 



GAC 



GGC 



AT->GC 



GGC 



AT-»GC 



GAC 



GAG 



GC->-CG 



TAC 



GC->TA 



CGG 



TGG 



GC-»AT 



TGG 



GC->AT 



CGG 



TGG 



GC-»AT 



GGG 



GC-*CG 



CCG 



GC->CG 



TGG 



GC^AT 



TGG 



GC^AT 



CTG 



GC-»TA 



TGG 



GC-^AT 



TGG 



GC->AT 



TGG 



GC->AT 



TGG 



GC-^AT 



TGG 



GC->AT 



CGC 



TGC 



GC-»AT 



GAG 



CCC 
AAG 



GC->CG 
GC->AT 



B-CLL 
Erythroleukemia 



B-ALL 



Hepatoca 



Lung (SCLC) ca 



Esophageal ca 



Esophageal ca 



Breast ca 



Lung (SCLC) ca 



Ovarian ca 



Esophageal ca 



Breast ca 



Colorectal ca 



Breast ca 



Richter's sdm 



B-CLL 



Colorectal ad 



Colorectal ca 



Rhabdomyosa 



Lung (NSCLC) ca 



Breast ca 



Bladder ca 



AML 



Breast ca 



B-ALL 



Burkitt lymphoma 



Richter's sdm 



Ovarian ca 



Li-Fraumeni sdm 



Colorectal ca 



Lung (NSCLC) ca 
Breast ca 
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10 



286 
286 



286 
287 



293 
298 



309 



GAA 



AAA_ 
GGA 



GC->AT 
AT-»GC 



GCA 



AT->CG 



GAG 



TAG 



GC->TA 



GGG 
GAG 



TGG 
TAG 



GC->TA 



GC->TA 



GGG 



GGT 



GC->TA 



AAG 



TAG 



AT->TA 



TAG 



AT->TA 



GCA 



ACA 



GC->AT 



CCC 



TCC 



GC->AT 



GC->TA 



Lung (SCLC) ca 



Li-Fraumeni sdm 



Burkitt lymphoma 



Glioblastoma 



Bladder ca 



Lung (SCLC) ca 



Esophageal ca 



Esophageal ca 



Breast ca 



Colorectal ca 



Lung (SCLC) ca 
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DELETIONS/INSERTIONS 



CODON 



137 



EVEMT 



del 7 



TUMOR TYPE 



Gastric ca 



143 



del 1 



Gastric ca 



152 



del 13 



Colorectal ad 



167 



del 1 



Breast ca 



168 



del 31 



Hepatoca 



175 



del 18 



Breast ca 



190 



del 3 



nul ALL 



201 



del 1 



Breast ca 



206 



del 1 



Burkitt lymphoma 



206 



del 1 



Burkitt lymphoma 



214 



del 1 



B-ALL 



236 



del 27 



Bladder ca 



239 



del 1 



Lung (NSCLC) ca 



262 



del 1 



Astrocytoma 



262 



del 24 



Gastric ca 



262 



del 24 



Lung CNSCLC) ca 



263 



del 1 



Esophageal ca 



264 



del 1 



AML 



286 



del 8 



Hepatoca 



293 



307 



del 1 



del 1 



Lung (NSCLC) ca 



Li-Fraumeni sdm 



381 



del 1 



Hepatoca 



Exon 5 



del 15 



B-ALL 



152 



239 



ins 1 



ins 1 



B-CLL 



W aldenstrom sdm 



252 



ins 4 



Gastric ca 



256 



ins 1 



AML 



275 



ins 1 



B-CLL 



301 



ins 1 



MDS 



307 
Exon 8 



ins 1 
ins 25 



Glioblastoma 
HCL 
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SPLICE MUTATIONS 



-5 



© 



10 



15 



20 



25 



INTRON 


SITE 


EVENT 


TUMOR TYPE 


Intron 3 


Accept 


GC-+CG 


Lung (SCLC) ca 


Intron 4 


Donor 


GC-*TA 


Lung (SCLC) ca 


Intron 4 


Donor 


GC->AT 


T->ALL 


Intron 5 


Donor 


GC->AT 


CML 


Intron 6 


Donor 


ATh>CG 


Lung (SCLC) ca 


Intron 6 


Accept 


AT-^TA 


Lung (SCLC) ca 


Intron 6 


Accept 


AT— >TA 


Lung (NSCLC) ca 


Intron 7 


Donor 


GC— >TA 


Lung (NSCLC) ca 


Intron 7 


Accept 


GC->CG 


Lung (SCLC) ca 


Intron 7 


Accept 


CG->AT 


AML 


Intron 7 


Donor 


GC-»TA 


Lung (SCLC) ca 


Intron 9 


Donor 


GC->TA 


Lung (SCLC) ca 



A. CFLP™ Analysis of p53 Mutations in Clinical Samples 

To permit the identiifcation of mutations in the p53 gene from clinical samples, 
nucleic acid comprising p53 gene sequences are prepared. The nucleic acid may 
comprise genomic DNA, RNA or cDNA forms of the P 53 gene. Nucleic acid may be 
extracted from a variety of clinical samples [fresh or frozen tissue, suspensions of cells 
(e.g., blood), cerebral spinal fluid, sputum, etc.] using a variety of standard techniques 
or commerically available kits. For example, kits which allow the isolation of RNA or 
DNA from tissue samples are available from Qiagen, Inc. (Chatsworth, CA) and 
Stratagene (LaJolla, CA), respectively. Total RNA may be isolated from tissues and 
tumors by a number of methods known to those skilled in the art and commercial kits 
are available to facilitate the isolation. For example, the RNEASY® kit (Qiagen Inc., 
Chatsworth, CA) provides protocol, reagents and plasticware to permit the isolation of 
total RNA from tissues, cultured cells or bacteria, with no modification to the 
manufacturer's instructions, in approximately 20 minutes. Should it be desirable, in 
the case of eukaryotic RNA isolates, to further enrich for messenger RNAs, the 
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polyadenylated RNAs in the mixture may be specifically isolated by binding to an 
oligo-deoxythymidine matrix, through the use of a kit such as the OLIGOTEX® kit 
(Qiagen). Comparable isolation kits for both of these steps are available through a 
number of commercial suppliers. 

5 In addition, RNA may be extracted from samples, including biopsy specimens, 

convienently by lysing the homogenized tissue in a buffer containing 0.22 M NaCl, 
0.75 mM MgCl 2 , 0.1 M Tris-HCl, pH 8.0, 12.5 raM EDTA, 0.25% NP40, 1% SDS, 
0.5 mM DTT, 500 u/ml placental RNAse inhibitor and 200 ng/ml Proteinase K. 
Following incubation at 37°C for 30 min, the RNA is extracted with 

10 phenolxhloroform (1:1) and the RNA is recovered by ethanol precipitation. 

Since the majority of p53 mutations are found within exons 5-8, it is 
convenient as a first analysis to examine a PCR fragment spanning this region. PCR 
fragments spanning exons 5- 8 may be amplified from clinical samples using the 
technique of RT-PCR (reverse transcription-PCR); kits which permit the user to start 

15 with tissue and produce a PCR product are available from Perkin Elmer (Norwalk, 

CT) and Stratagene (LaJolla, CA). The RT-PCR technique generates a single- stranded 
cDNA corresponding to a chosen segment of the coding region of a gene by using 
reverse transcription of RNA; the single-stranded cDNA is then used as template in the 
PCR. In the case of the p53 gene, an approximately 600 bp fragment spanning exons 

20 5-8 is generated using primers located in the coding region immediately adjacent to 

exons 5 and 8 in the RT-PCR. The PCR amplified segment is then subjected to the 
CFLP reaction and the reaction products are analysed as described above in section 
VIII. 

Fragments suitable for CFLP analysis may also be generated by PCR 
25 amplification of genomic DNA. DNA is extracted from a sample and primers 

corresponding to sequences present in introns 4 and 8 are used to amplify a segment of 
the p53 gene spanning exons 5-8 which includes introns 5-7 (an approximately 2 kb 
fragment). If it is desirable to use smaller fragments of DNA in the CFLP reaction, 
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primers may be chosen to amplify smaller (1 kb or less) segments of genomic DNA or 
alternatively a large PCR fragment may be divided into two or more smaller frgaments 

using resstriction enzymes. 

In order to facilitate the identification of p53 mutations in the clinical setting, a 
5 library containing the CFLP pattern produced by previously characterized mutations 

may be provided. Comparision of the pattern generated using nucleic acid derived 
from a clinical sample with the patterns produced by cleavage of known and 
characterized P 53 mutations will allow the rapid identification of the specific p53 
mutation present in the patient's tissue. The comparison of CFLP patterns from 
10 clinical samples to the patterns present in the library may be accomplished by a variety 

of means. The simplest and least expensive comparision involves visual comparision. 
Given the large number of unique mutations known at the p53 locus, visual (i.e., 
manual) comparision may be too time-consuming, especially when large numbers of 
clinical isolates are to be screened. Therefore the CFLP patterns or "bar codes" may 
15 be provided in an electronic format for ease and efficiency in comparision. Electronic 

entry may comprise storage of scans of gels containing the CFLP products of the 
reference P 53 mutations (using for example, the GENEREADER and GEL DOCTOR 
Fluorescence Gel documentation system (BioRad, Hercules, CA) or the 
IMAGEMASTER (Pharmacia Biotech, Piscataway, NJ). Alternatively, as the detection 
20 of cleavage patterns may be automated using DNA sequencing instrumentation (see 

Example 20), the banding pattern may be stored as the signal collected from the 
appropriate channels during an automated run [examples of instrumentation suitable for 
such analysis and data collection include fluorescence-based gel imagers such as 
fluoroimagers produced by Molecular Dynamics and Hitachi or by real-time 
25 electrophoresis detection systems such as the ABI 377 or Pharmacia ALF DNA 

Sequencer]. 
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B. Generation of a Library of Characterized p53 Mutations 

The generation of a library of characterized mutations will enable clinical 
samples to be rapidly and directly screened for the presence of the most common p53 
mutations. Comparision of CFLP patterns generated from clinical samples to the p53 
5 bar code library will establish both the presence of a mutation in the p53 gene and its 

precise identity without the necessity of costly and time consuming DNA sequence 
analysis. 

The p53 bar code library is generated using reverse genetics. Engineering of 
p53 mutations ensures the identity and purity of each of the mutations as each 

10 engineered mutation is confirmed by DNA sequencing. The individual p53 mutations 

in p53 bar code library are generated using the 2-step "recombinant PCR" technique 
[Higuchi, R. (1991) In Ehrlich, H.A. (Ed.), PCR Technology: Principles and 
Applications for DNA Amplification, Stockton Press, New York, pp. 61-70 and 
Nelson, R.M. and Long, G.L. (1989) Analytical Biochem. 180:147]. Figure 77 

15 provides a schematic representation of one method of a 2-step recombinant PCR 

technique that may be used for the generation of p53 mutations. 

The template for the PCR amplifications is the entire human p53 cDNA gene. 
In the first of the two PCRs (designated "PCR 1" in Fig. 77), an oligonucleotide 
containing the engineered mutation ("oligo A" in Fig. 77) and an oligonucleotide 

20 containing a 5' arm of approximately 20 non-complementary bases ("oligo B") are 

used to amplify a relatively small region of the target DNA (100-200 bp). The 
resulting amplification product will contain the mutation at its extreme 5' end and a 
foreign sequence at its 3' end. The 3' sequence is designed to include a unique 
restriction site (e.g., Eco RI) to aid in the directional cloning of the final amplification 

25 fragment (important for purposes of sequencing and archiving the DNA containing the 

mutation). The product generated in the upstream or first PCR may be gel purified if 
desired prior to the use of this first PCR product in the second PCR; however gel 
purification is not required once it is established that this fragment is the only species 
amplified in the PCR. 
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The small PCR fragment containing the engineered mutation is then used to 
direct a second round of PCR (PCR 2). In PCR 2, the target DNA is a larger 
fragment (approximately 1 kb) of the same subcloned region of the p53 cDNA. 
Because the sequence at the 3' end of the small PCR fragment is not complementary 
5 to any of the sequences present in the target DNA, only that strand in which the 

mismatch is at the extreme 5' end is amplified in PCR 2 (a 3' non-templated arm 
cannot be extended in PCR). Amplification is accomplished by the addition of a 
primer complementary to a region of the target DNA upstream of the locus of the 
engineered mutation ("oligo C") and by the addition of a primer complementary to the 
10 5' noncomplementary sequence of the small product of PCR 1 ("oligo D"). By 

directing amplification from the noncomplementary sequence, this procedure results in 
the specific amplification of only those sequences containing the mutation. In order to 
facilitate cloning of these PCR products into a standard vector, a second unique 
restriction site can be engineered into oligo C (e.g., Hindlll). 
! 5 The use of this 2-step PCR approach requires that only one primer be 

synthesized for each mutant to be generated after the initial set-up of the system (i.e., 
oligo A). Oligos B, C and D can be used for all mutations generated within a given 
region. Because oligos C and D are designed to include different and unique 
restriction sites, subsequent directional cloning of these PCR products into plasmid 
20 vectors (such as pUC 19) is greatly simplified. Selective amplification of only those 

sequences that include the desired mutational change simplifies identification of 
mutation-containing clones as only verification of the sequence of insert containing 
plasmids is required. Once the sequence of the insert has been verified, each 
mutation-containing clone may be maintained indefinitely as a bacterial master stock. 
25 In addition, DNA stocks of each mutant can be maintained in the form of large scale 

PCR preparations. This permits distribution of either bactera harboring plasmids 
containing a given mutation or a PCR preparation to be distributed as individual 
controls in kits containing reagents for the scanning of p53 mutations in clinical 
samples or as part of a supplemental master p53 mutation library control kit. 
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An alternative 2-step recombinant PCR is diagrammed in Figure 78, and 
described in Example 32. In this method two mutagenic oligonucleotides, one for 
each strand, are synthesized. These oligonucleotides are substantially complementary 
to each other but are opposite in orientation.. That is, one is positioned to allow 
amplification of an "upstream" region of the DNA, with the mutation incorporated into 
the 3' proximal region of the upper, or sense strand, while the other is positioned to 
allow amplification of a "downstream" segment with the intended mutation 
incorporated into the 5' proximal region of the upper, or sense strand. These two 
double stranded products share the sequence provided by these mutagenic 
oligonucleotides. When purified, combined, denatured and annealed, those strands that 
anneal with recessed 3' ends can be extended or filled in by the action of DNA 
polymerase, thus recreating a full length molecules with the mutation in the central 
region. This recombinant can be amplified by the use of the "outer" primer pair,those 
used to make the 5' end of the "upstream" and the 3' end of the "downstream" 
intermediate fragments. 

While extra care must be taken with this method (in comparison with the 
method described above) because the outer primers can amplify both the recombinant 
and the un-modified sequence, this method does allow rapid recombinant PCR to be 
performed using existing end primers, and without the introduction of foreign 
sequences. In summary, this method is often used if only a few recombinations are to 
be performed. When large volumes of mutagenic PCRs are to be performed, the first 
described method is preferable as the first method requires a single oligo be 
synthesized for each mutagenesis and only recombinants are amplified. 

An important feature of kits designed for the identifcation of p53 mutations in 
clinical samples is the inclusion of the specific primers to be used for generating PCR 
fragments to be analyzed for CFLP. While DNA fragments from 100 to over 1500 bp 
can be reproducibly and accurately analyzed for the presence of sequence 
polymorphisms by this technique, the precise patterns generated from different length 
fragments of the same input DNA sequence will of course vary. Not only are patterns 
shifted relative to one another depending on the length of the input DNA, but in some 
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transfer of DNA onto an expensive membrane support, fluorescence-based detection 
methods may be preferred. It is important to note, however, that any of the above 
methods may be used to generate CFLP bar codes to be input into the database. 

In addition to their being a direct, non-isotopic means of detecting CFLP 
patterns, fluorescence-based schemes offer a noteworthy additional advantage in 
clinical applications. CFLP allows the analysis of several samples in the same tube 
and in the same lane on a gel. This "multiplexing" permits rapid and automated 
comparison of a large number of samples in a fraction of the time and for a lower cost 
than can be realized through individual analysis of each sample. This approach opens 
the door to several alternative applications. A researcher could decide to double, triple 
or quadruple (up to 4 dyes have been demonstrated to be detectable and compatible in 
a single lane in commerically available DNA sequencing instrumentation such as the 
ABI 373/377) the number of samples run on a given gel. Alternatively, the analyst 
may include a normal p53 gene sample in each tube, and each gel lane, along with a 
differentially labeled size standard, as a internal standard to verify both the presence 
and the exact location(s) of a pattern difference(s) between the normal p53 gene and 
putative mutants. 

IV. Detection and Identification of Pathogens Using the CFLP™ Method 
A. Detection and Identification of Hepatitis C Virus 

Hepatitis C virus (HCV) infection is the predominant cause of post-transfusion 
non-A, non-B (NANB) hepatitis around the world. In addition, HCV is the major 
etiologic agent of hepatocellular carcinoma (HCC) and chronic liver disease world 
wide. HCV infection is transmitted primarily to blood transfusion recipients and 
intravenous drug users although maternal transmission to offspring and transmission to 
recipients of organ transplants have been reported. 

The genome of the positive-stranded RNA hepatitis C virus comprises several 
regions including 5' and 3' noncoding regions {i.e., 5' and 3' untranslated regions) 
and a polyprotein coding region which encodes the core protein (C), two envelope 
glycoproteins (El and E2/NS1) and six nonstructural glycoproteins (NS2-NS5b). 

- 91 - 



Molecular biological analysis of the small (9.4 kb) RNA genome has showed that 
some regions of the genome are very highly conserved between isolates, while other 
regions are fairly rapidly changeable. The 5' noncoding region (NCR) is the most 
highly conserved region in the HCV. These analyses have allowed these viruses to be 
5 divided into six basic genotype groups, and then further classified into over a dozen 

sub-types [the nomenclature and division of HCV genotypes is evolving; see 
Altamirano et al, J. Infect. Dis. 171:1034 (1995) for a recent classification scheme]. 
These viral groups are associated with different geographical areas, and accurate 
identification of the agent in outbreaks is important in monitoring the disease. While 
10 only Group 1 HCV has been observed in the United States, multiple HCV genotypes 

have been observed in both Europe and Japan. 

The ability to determine the genotype of viral isolates also allows comparisons 
of the clinical outcomes from infection by the different types of HCV, and from 
infection by multiple types in a single individual. HCV type has also been associated 
15 with differential efficacy of treatment with interferon, with Group 1 infected 

individuals showing little response [Kanai et al, Lancet 339:1543 (1992) and 
Yoshioka et al, Hepatology 16:293 (1992)]. Pre-screening of infected individuals for 
the viral type will allow the clinician to make a more accurate diagnosis, and to avoid 
costly but fruitless drug treatment. 
2 0 Existing methods for determining the genotype of HCV isolates include PCR 

amplification of segments of the HCV genome coupled with either DNA sequencing or 
hybridization to HCV-specific probes, RFLP analysis of PCR amplified HCV DNA 
anything else?. All of these methods suffer from the limitations discussed above (i.e., 
DNA sequencing is too labor-intensive and expensive to be practical in clinical 
25 laboratory settings; RFLP analysis suffers from low sensitivity). 

Universal and genotype specific primers have been designed for the 
amplification of HCV sequences from RNA extracted from plasma or serum [Okamoto 
et al J. Gen. Virol. 73:673 (1992); Yoshioka et al, Hepatology 16:293 (1992) and 
Altamirano et al, supra]. These primers can be used to generate PCR products which 
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serve as substrates in the CFLP™ assay of the present invention. As shown herein 
CFLP™ analysis provides a rapid and accurate method of typing HCV isolates. 
CFLP™ analysis of HCV substrates allows a distinction to be made between the major 
genotypes and subtypes of HCV thus providing improved methods for the genotyping 
of HCV isolates. 

B. Detection and Identification of Multi-Drug Resistant M. tuberculosis 

In the past decade there has been a tremendous resurgence in the incidence of 
tuberculosis in this country and throughout the world. In the United States, the 
incidence of tuberculosis has risen steadily during past decade, accounting for 2000 
deaths annually, with as many as 10 million Americans infected with the disease. The 
situation is critical in New York City, where the incidence has more than doubled in 
the past decade, accounting for 14% of all new cases in the United States in 1990 
[Frieden et al, New Engl. J. Med. 328:521 (1993)]. 

The crisis in New York City is particularly dire because a significant proportion 
(as many as one-third) of the recent cases are resistant to one or more antituberculosis 
drugs [Frieden et al, supra and Hughes, Scrip Magazine May (1994)]. Multi-drug 
resistant tuberculosis (MDR-TB) is an iatrogenic disease that arises from incomplete 
treatment of a primary infection [Jacobs, Jr., Clin. Infect. Dis. 19:1 (1994)]. MDR-TB 
appears to pose an especially serious risk to the immunocompromised, who are more 
likely to be infected with MDR-TB strains than are otherwise healthy individuals 
[Jacobs, Jr., supra]. The mortality rate of MDR-TB in immunocompromised 
individuals is alarmingly high, often exceeding 90%, compared to a mortality rate of 
<50% in otherwise uncompromised individuals [Donnabella et al, Am. J. Respir. Dis. 
11:639 (1994)]. 

From a clinical standpoint, tuberculosis has always been difficult to diagnose 
because of the extremely long generation time of Mycobacterium tuberculosis as well 
as the environmental prevalence of other, faster growing mycobacterial species. The 
doubling time of M. tuberculosis is 20-24 hours, and growth by conventional methods 
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typically requires 4 to 6 weeks to positively identify M. tuberculosis [Jacobs, Jr. et ah, 
Science 260:819 (1993) and Shinnick and Jones in Tuberculosis: Pathogenesis, 
Protection and Control, Bloom, ed., American Society of Microbiology, Washington, 
D.C. (1994), pp. 517-530]. It can take an additional 3 to 6 weeks to diagnose the drug 
5 susceptibility of a given strain [Shinnick and Jones, supra]. Needless to say, the health 

risks to the infected individual, as well as to the public, during a protracted period in 
which the patient may or may not be symptomatic, but is almost certainly contagious, 
are considerable. Once a drug resistance profile has been elucidated and a diagnosis 
made, treatment of a single patient can cost up to $250,000 and require 24 months. 

10 The recent explosion int he incidence of the disease, together with the dire risks 

posed by MDR strains, have combined to spur a burst of research activity and 
commercial development of procedures and products aimed at accelerating the 
detection of M. tuberculosis as well the elucidation of drug resistance profiles of M. 
tuberculosis clinical isolates. A number of these methods are devoted primarily to the 

1 5 task of determining whether a given strain is M. tuberculosis or a mycobacterial 

species other than tuberculosis. Both culture based methods and nucleic-acid based 
methods have been developed that allow M. tuberculosis to be positively identified 
more rapidly than by classical methods: detection times have been reduced from 
greater than 6 weeks to as little as two weeks (culture-based methods) or two days 

20 (nucleic acid-based methods). While culture-based methods are currently in wide- 

spread use in clinical laboratories, a number of rapid nucleic acid-based methods that 
can be applied directly to clinical samples are under development. For all of the 
techniques described below, it is necessary to first "decontaminate" the clinical 
samples, such as sputum (usually done by pretreatment with N- acetyl L-cysteine and 

25 NaOH) to reduce contamination by non-mycobacterial species [Shinnick and Jones, 

supra.] 

The polymerase chain reaction (PCR) has been applied to the detection of M. 
tuberculosis and can be used to detect its presence directly from clinical specimens 
within one to two days. The more sensitive techniques rely on a two-step procedure: 
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the first step is the PCR amplification itself, the second is an analytical step such as 
hybridization of the amplicon to a M. tuberculosis-specific oligonucleotide probe, or 
analysis by RFLP or DNA sequencing [Shinnick and Jones, supra]. 

The Amplified M. tuberculosis Direct Test (AMTDT; Gen-Probe) relies on 
5 Transcription Mediated Amplification [TMA; essentially a self-sustained sequence 

reaction (3SR) amplification] to amplify target rRNA sequences directly from clinical 
specimens. Once the rRNA has been amplified, it is then detected by a dye-labeled 
assay such as the PACE2. This assay is highly subject to inhibition by substances 
present in clinical samples. 

10 The Cycling Probe Reaction (CPR; ID Biomedical). This technique, which is 

under development as a diagnostic tool for detecting the presence of M. tuberculosis, 
measures the accumulation of signal probe molecules. The signal amplification is 
accomplished by hybridizing tripartite DNA-RNA-DNA probes to target nucleic acids, 
such as M. tuberculosis-specific sequences. Upon the addition of RNAse H, the RNA 

1 5 portion of the chimeric probe is degraded, releasing the DNA portions, which 

accumulate linearly over time to indicate that the target sequence is present [Yule, 
Bio/Technology 12:1335 (1994)]. The need to use of RNA probes is a drawback, 
particularly for use in crude clinical samples, where RNase contamination is often 
rampant. 

20 The above nucleic acid-based detection and differentiation methods offer a clear 

time savings over the more traditional, culture-based methods. While they are 
beginning to enter the clinical setting, their usefulness in the routine diagnosis of M. 
tuberculosis is still in question, in large part because of problems with associated with 
cross-contamination and low-sensitivity relative to culture-based methods. In addition, 

25 many of these procedures are limited to analysis of respiratory specimens [Yule, 

Bio/Technology 12:1335 (1994)]. 

ii) Determination of the antibiotic resistance profile of M. tuberculosis 
a) Culture-based methods: Once a positive identification of M. 
tuberculosis has been made, it is necessary to characterize the extent and nature of the 
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strain's resistance to antibiotics. The traditional method used to determine antibiotic 
resistance is the direct proportion agar dilution method, in which dilutions of culture 
are plated on media containing antibiotics and on control media without antibiotics. 
This method typically adds an additional 2-6 weeks to the time required for diagnosis 

5 and characterization of an unknown clinical sample [Jacobs, Jr., supra]. 

The Luciferase Reporter Mycobacteriophage (LRM) assay was first described in 
1993 [Jacobs, Jr. et al, Science 260:819 (1993)]. In this assay, a mycobacteriophage 
containing a cloned copy of the luciferase gene is used to infect mycobacterial 
cultures. In the presence of luciferin and ATP, the expressed luciferase produces 

10 photons, easily distinguishable by eye or by a luminometer, allowing a precise 

determination of the extent of mycobacterial growth in the presence of antibiotics. 
Once sufficient culture has been obtained (usually 10-14 days post-inoculation), the 
assay can be completed in 2 days. This method suffers from the fact that the LRM are 
not specific for M. tuberculosis: they also infect M. smegmatis and M. bovis {e.g., 

15 BCG), thereby complicating the interpretation of positive results. Discrimination 

between the two species must be accomplished by growth on specialized media which 
does not support the growth of M. tuberculosis {e.g., NAP media). This confirmation 

requires another 2 to 4 days. 

The above culture-based methods for determining antibiotic resistance will 

20 continue to play a role in assessing the effectiveness of putative new 

anti-mycobacterial agents and those drugs for which a genetic target has not yet been 
identified. However, recent success in elucidating the molecular basis for resistance to 
a number of anti-mycobacterial agents, including many of the front-line drugs, has 
made possible the use of much faster, more accurate and more informative DNA 

25 polymorphism-based assays. 

b) DNA-based methods: Genetic loci involved in resistance to 
isoniazid, rifampin, streptomycin, fluoroquinolones, and ethionamide have been 
identified [Jacobs, Jr., supra; Heym et al, Lancet 344:293 (1994) and Morris et al, J. 
Infect. Dis. 171:954 (1995)]. A combination of isoniazid (inh) and rifampin (rif) along 
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with pyrazinamide and ethambutol or streptomycin, is routinely used as the first line of 
attack against confirmed cases of M. tuberculosis [Banerjee et al, Science 263:227 
(1994)]. Consequently, resistance to one or more of these drugs can have disastrous 
implications for short course chemotherapy treatment. The increasing incidence of 
5 such resistant strains necessitates the development of rapid assays to detect them and 

thereby reduce the expense and community health hazards of pursuing ineffective, and 
possibly detrimental, treatments. The identification of some of the genetic loci 
involved in drug resistance has facilitated the adoption of mutation detection 
technologies for rapid screening of nucleotide changes that result in drug resistance. 

10 The availability of amplification procedures such as PCR and SDA, which have been 

successful in replicating large amounts of target DNA directly from clinical specimens, 
makes DNA-based approaches to antibiotic profiling far more rapid than conventional, 
culture-based methods. 

The most widely employed techniques in the genetic identification of mutations 

15 leading to drug resistance are DNA sequencing, Restriction Fragment Length 

Polymorphism (RFLP), PCR-Single Stranded Conformational Polymorphism 
(PCR-SSCP), and PCR-dideoxyfmgerprinting (PCR-ddF). All of these techniques have 
drawbacks as discussed above. None of them offers a rapid, reproducible means of 
precisely and uniquely identifying individual alleles. 

20 In contrast the CFLP™ method of the present invetion provides an approach 

that relies on structure specific cleavage to generate distinct collections of DNA 
fragments. This method is highly sensitive (>98%) in its ability to detect sequence 
polymorphisms, and requires a fraction of the time, skill and expense of the techniques 
described above. 
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The application of the CFLP™ method to the detection of MDR-TB is 
illustrated herein using segments of DNA amplified from the rpoB and katG genes. 
Other genes associated with MDR-TB, including but not limited to those involved in 
conferring resistance to isoniazid (inhA), streptomycin (rpsL and rrs), and 
5 fluoroquinoline (gyrA), are equally well suited to the CFLP™ assay. 

C. Detection and Identification of Bacterial Pathogens 

Identification and typing of bacterial pathogens is critical in the clinical 
management of infectious diseases. Precise identity of a microbe is used not only to 
differentiate a disease state from a healthy state, but is also fundamental to 

10 determining whether and which antibiotics or other antimicrobial therapies are most 

suitable for treatment. Traditional methods of pathogen typing have used a variety of 
phenotypic features, including growth characteristics, color, cell or colony morphology, 
antibiotic susceptibility, staining, smell and reactivity with specific antibodies to 
identify bacteria. All of these methods require culture of the suspected pathogen, 

1 5 which suffers from a number of serious shortcomings, including high material and 

labor costs, danger of worker exposure, false positives due to mishandling and false 
negatives due to low numbers of viable cells or due to the fastidious culture 
requirements of many pathogens. In addition, culture methods require a relatively long 
time to achieve diagnosis, and because of the potentially life-threatening nature of such 

20 infections, antimicrobial therapy is often started before the results can be obtained. In 

many cases the pathogens are very similar to the organisms that make up the normal 
flora, and may be indistinguishable from the innocuous strains by the methods cited 
above. In these cases, determinion of the presence of the pathogenic strain may 
require the higher resolution afforded by more recently developed molecular typing 

25 methods. 

A number of methods of examining the genetic material from organisms of 
interest have been developed. One way of performing this type of analysis is by 
hybridization of species-specific nucleic acid probes to the DNA or RNA from the 
organism to be tested. This may be done by immobilizing the denatured nucleic acid 
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to be tested on a membrane support, and probing with labeled nucleic acids that will 
bind only in the presence of the DNA or RNA from the pathogen. In this way, 
pathogens can be identified. Organisms can be further diffrentiated by using the RFLP 
method described above, in which the genomic DNA is digested with one or more 
5 restriction enzymes before electrophoretic separation and transfer to a nitrocellulose or 

nylon membrane support. Probing with the species-specific nucleic acid probes will 
reveal a banding pattern that, if it shows variation between isolates, can be used as a 
reproducible way of discriminating between strains. However, these methods are 
susceptible to the drawbacks outlined above: hybridization-based assays are 

1 0 time-consuming and may give false or misleading results if the stringency of the 

hybridization is not well controlled, and RFLP identification is dependent on the 
presence of suitable restriction sites in the DNA to be analyzed. 

To address these concerns about hybridization and RFLP as diagnostic tools, 
several methods of molecular analysis based on polymerase chain reaction (PCR) 

15 amplification have gained popularity. In one well-accepted method, called PCR 

fingerprinting, the size of a fragment generated by PCR is used as an identifier. In 
this type of assay, the primers are targeted to regions containing variable numbers of 
tandem repeated sequences (referred to as VNTRs an eukaryotes). The number of 
repeats, and thus the length of the PCR amplicon, can be characteristic of a given 

20 pathogen, and co-amplification of several of these loci in a single reaction can create 

specific and reproducible fingerprints, allowing discrimination between closely related 
species. 

In some cases where organisms are very closely related, however, the target of 
the amplification does not display a size difference, and the amplified segment must 

25 be further probed to achieve more precise identification. This may be done on a solid 

support, in a fashion analogous to the whole-genome hybridization described above, 
but this has the same problem with variable stringency as that assay. Alternatively, 
the interior of the PCR fragment may be used as a template for a sequence- specific 
ligation event. As outlined above for the LCR, in this method, single stranded probes 

30 to be ligated are positioned along the sequence of interest on either side of an 
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identifying polymorphism, so that the success or failure of the ligation will indicate the 
presence or absence of a specifice nucleotide sequence at that site. With either 
hybridization or ligation methods of PCR product analysis, knowledge of the precise 
sequence in the area of probe binding must be obtained in advance, and differences 
5 outside the probe binding area are not detected. These methods are poorly suited to 

the examination and typing of new isolates that have not been fully characterized. 

In the methods of the present invention, primers that recognize conserved 
regions of bacterial ribosomal RNA genes allow amplification of segments of these 
genes that include sites of variation. The variations in ribosomal gene sequences have 

10 become an accepted method not only of differenting between similar organisms on a 

DNA sequence level, but their consistant rate of change allows these sequences to be 
used to evaluate the evolutionary relatedness of orgnaisms. That is to say, the more 
similar the nucleic acid is at the sequence level, the more closely related the 
organisms in discussion are considered to be. [Woese, Bacterial Evolution. 

15 Microbiological Reviews, vol 51, No. 2. 1987]. The present invention allows the 

amplification products derived from these sequences to be used to create highly 
individual barcodes {i.e., cleavage patterns), allowing the detection of sequence 
polymorphisms without prior knowledge of the site, character or even the presence of 
said polymorphisms. With appropriate selection of primers, amplification can be 

20 made to be either all-inclusive {e.g., using the most highly conserved ribosomal 

sequences) to allow comparison of distantly related organisms, or the primers can be 
chosen to be very specific for a given genus, to allow examination at the species and 
subspecies level. While the examination of ribosomal genes is extremely useful in 
these characterizations, the use of the CFLP™ method in bacterial typing is not limited 

25 to these genes. Other genes, including but not limited to those associated with specifc 

growth characterisics, {e.g., carbon source preference, antibiotic resistance, resistance 
to methycillin or antigen production), or with particular cell morphologies (such as 
pilus formation) are equally well suited to the CFLP™ assay. 
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D. Extraction of Nucleic Acids From Clinical Samples 

To provide nucleic acid substrates for use in the detection and identification of 
microorganisms in clinical samples using the CFLP™ assay, nucleic acid is extracted 
from the sample. The nucleic acid may be extracted from a variety of clinical samples 
[fresh or frozen tissue, suspensions of cells (e.g., blood), cerebral spinal fluid, sputum, 
urine, etc.] using a variety of standard techniques or commercially available kits. For 
example, kits which allow the isolation of RNA or DNA from tissue samples are 
available from Qiagen, Inc. (Chatsworth, CA) and Stratagene (LaJolla, CA). For 
example, the QIAAMP Blood kits permit the isolation of DNA from blood (fresh, 
frozen or dried) as well as bone marrow, body fluids or cell suspensions. QIAAMP 
tissue kits permit the isolation of DNA from tissues such as muscles, organs and 
tumors. 

It has been found that crude extracts from relatively homogenous specimens 
(such as blood, bacterial colonies, viral plaques, or cerebral spinal fluid) are better 
suited to severing as templates for the amplification of unique PCR products than are 
more composite specimens (such as urine, sputum or feces;) [Shibata in PCR: The 
Polymerase Chain Reaction, Mullis et al, eds., Birkhauser, Boston (1994), pp. 47-54]. 
Samples which contain relatively few copies of the material to be amplified (i.e., the 
target nucleic acid), such as cerebral spinal fluid, can be added directly to a PCR. 
Blood samples have posed a special problem in PCRs due to the inhibitory properties 
of red blood cells. The red blood cells must be removed prior to the use of blood in a 
PCR; there are both classical and commercially available methods for this purpose 
[e.g., QIAAMP Blood kits, passage through a CHELEX 100 column (BioRad), etc.]. 
Extraction of nucleic acid from sputum, the specimen of choice for the direct detection 
of M. tuberculosis, requires prior decontamination to kill or inhibit the growth of other 
bacterial species. This decontamination is typically accomplished by treatment of the 
sample with N-acetyl L-cysteine and NaOH (Shinnick and Jones, supra). This 
decontamination process is necessary only when the sputum specimen is to be cultured 
prior to analysis. 
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EXPERIMENTAL 

The following examples serve to illustrate certain preferred embodiments and 
aspects of the present invention and are not to be construed as limiting the scope 
thereof. 

5 In the disclosure which follows, the following abbreviations apply:°C (degrees 

Centigrade); g (gravitational field); vol (volume); w/v (weight to volume); v/v (volume 
to volume); BSA (bovine serum albumin); CTAB (cetyltrimethylammonium bromide); 
HPLC (high pressure liquid chromatography); DNA (deoxyribonucleic acid); IV S 
(intervening sequence); p (plasmid); [il (microliters); ml (milliliters); (j,g (micrograms); 

10 pmoles (picomoles); mg (milligrams); MOPS (3-[N-Morpholino]propanesulfonic acid); 

M (molar); mM (milliMolar); \xM (microMolar); nm (nanometers); kdal (kilodaltons); 
OD (optical density); EDTA (ethylene diamine tetra- acetic acid); FITC (fluorescein 
isothiocyanate); SDS (sodium dodecyl sulfate); NaP0 4 (sodium phosphate); Tris 
(tris(hydroxymethyl)-aminomethane); PMSF (phenylmethylsulfonylfluoride); TBE 

15 (Tris-Borate-EDTA, i.e., Tris buffer titrated with boric acid rather than HC1 and 

containing EDTA) ; PBS (phosphate buffered saline); PPBS (phosphate buffered saline 
containing 1 mM PMSF); PAGE (polyacrylamide gel electrophoresis); Tween 
(polyoxyethylene-sorbitan); Boehringer Mannheim (Boehringer Mannheim, 
Indianapolis, IN); Dynal (Dynal A.S., Oslo, Norway); Epicentre (Epicentre 

20 Technologies, Madison, WI); National Biosciences (National Biosciences, Plymouth, 

MN); New England Biolabs (New England Biolabs, Beverly, MA); Novagen 
(Novagen, Inc., Madison, WI); Perkin Elmer (Perkin Elmer, Norwalk, CT); Promega 
Corp. (Promega Corp., Madison, WI); RJ Research (RJ Research, Inc., Watertown, 
MA); Stratagene (Stratagene Cloning Systems, La Jolla, CA); USB (U.S. Biochemical, 

25 Cleveland, OH). 
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EXAMPLE 1 

Characteristics Of Native Thermostable DNA Polymerases 
A. 5' Nuclease Activity Of DNAPTa^ 

During the polymerase chain reaction (PCR) [Saiki et al, Science 239:487 
(1988); Mullis and Faloona, Methods in Enzymology 155:335 (1987)], DNAPTaq is 
able to amplify many, but not all, DNA sequences. One sequence that cannot be 
amplified using DNAP7a<7 is shown in Figure 6 (Hairpin structure is SEQ ID NO: 15, 
PRIMERS are SEQ ID NOS: 16-17.) This DNA sequence has the distinguishing 
characteristic of being able to fold on itself to form a hairpin with two single-stranded 
arms, which correspond to the primers used in PCR. 

To test whether this failure to amplify is due to the 5' nuclease activity of the 
enzyme, we compared the abilities of DNAPTaq and DNAPStf to amplify this DNA 
sequence during 30 cycles of PCR. Synthetic oligonucleotides were obtained from 
The Biotechnology Center at the University of Wisconsin-Madison. The DNAP7a# 
and DNAPStf were from Perkin Elmer (i.e., AmpliTaq™ DNA polymerase and the 
Stoffel fragment of Amplitaq™ DNA polymerase). The substrate DNA comprised the 
hairpin structure shown in Figure 6 cloned in a double-stranded form into pUC19. 
The primers used in the amplification are listed as SEQ ID NOS: 16-17. Primer SEQ 
ID NO: 17 is shown annealed to the 3' arm of the hairpin structure in Fig. 6. Primer 
SEQ ID NO: 16 is shown as the first 20 nucleotides in bold on the 5' arm of the 
hairpin in Fig. 6. 

Polymerase chain reactions comprised 1 ng of supercoiled plasmid target DNA, 
5 pmoles of each primer, 40 uM each dNTP, and 2.5 units of DNAPTag or DNAPStf, 
in a 50 ul solution of 10 mM Tris'Cl pH 8.3. The DNAPJbg reactions included 50 
mM KC1 and 1.5 mM MgCl 2 . The temperature profile was 95°C for 30 sec, 55°C for 
1 min. and 72°C for 1 min., through 30 cycles. Ten percent of each reaction was 
analyzed by gel electrophoresis through 6% polyacrylamide (cross- linked 29:1) in a 
buffer of 45 mM Tris'Borate, pH 8.3, 1.4 mM EDTA. 
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The results are shown in Figure 7. The expected product was made by 
DNAPStf (indicated simply as "S") but not by DNAPTaq (indicated as "T"). We 
conclude that the 5' nuclease activity of DNAPTaq is responsible for the lack of 
amplification of this DNA sequence. 

To test whether the 5' unpaired nucleotides in the substrate region of this 
structured DNA are removed by DNAPTaq, the fate of the end-labeled 5' arm during 
four cycles of PCR was compared using the same two polymerases (Figure. 8). The 
hairpin templates, such as the one described in Figure 6, were made using DNAPStf 
and a 32 P-5 '-end-labeled primer. The 5 '-end of the DNA was released as a few large 
fragments by DNAPTaq but not by DNAPStf. The sizes of these fragments (based on 
their mobilities) show that they contain most or all of the unpaired 5' arm of the 
DNA. Thus, cleavage occurs at or near the base of the bifurcated duplex. These 
released fragments terminate with 3' OH groups, as evidenced by direct sequence 
analysis, and the abilities of the fragments to be extended by terminal deoxynucleotidyl 
transferase. 

Fi gures 9-11 show the results of experiments designed to characterize the 
cleavage reaction catalyzed by DNAPTaq. Unless otherwise specified, the cleavage 
reactions comprised 0.01 pmoles of heat-denatured, end-labeled hairpin DNA (with the 
unlabeled complementary strand also present), 1 pmole primer (complementary to the 
3' arm) and 0.5 units of DNAPTaq (estimated to be 0.026 pmoles) in a total volume 
of 10p.l of 10 mM Tris-Cl, ph 8.5, 50 mM KC1 and 1.5 mM MgCl 2 . As indicated, 
some reactions had different concentrations of KC1, and the precise times and 
temperatures used in each experiment are indicated in the individual figures. The 
reactions that included a primer used the one shown in Figure 6 (SEQ ID NO: 17). In 
some instances, the primer was extended to the junction site by providing polymerase 
and selected nucleotides. 

Reactions were initiated at the final reaction temperature by the addition of 
either the MgCl 2 or enzyme. Reactions were stopped at their incubation temperatures 
by the addition of 8 ul of 95% formamide with 20 mM EDTA and 0.05% marker 
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dyes. The T m calculations listed were made using the Oligo™ primer analysis 
software from National Biosciences, Inc. These were determined using 0.25 uM as the 
DNA concentration, at either 15 or 65 mM total salt (the 1.5 mM MgCl 2 in all 
reactions was given the value of 15 mM salt for these calculations). 

Figure 9 is an autoradiogram containing the results of a set of experiments and 
conditions on the cleavage site. Figure 9 A is a determination of reaction components 
that enable cleavage. Incubation of 5 '-end-labeled hairpin DNA was for 30 minutes at 
55°C, with the indicated components. The products were resolved by denaturing 
polyacrylamide gel electrophoresis and the lengths of the products, in nucleotides, are 
indicated. Figure 9B describes the effect of temperature on the site of cleavage in the 
absence of added primer. Reactions were incubated in the absence of KC1 for 10 
minutes at the indicated temperatures. The lengths of the products, in nucleotides, are 
indicated. 

Surprisingly, cleavage by DNAPTaq requires neither a primer nor dNTPs (see 
Fig. 9A). Thus, the 5' nuclease activity can be uncoupled from polymerization. 
Nuclease activity requires magnesium ions, though manganese ions can be substituted, 
albeit with potential changes in specificity and activity. Neither zinc nor calcium ions 
support the cleavage reaction. The reaction occurs over a broad temperature range, 
from 25°C to 85°C, with the rate of cleavage increasing at higher temperatures. 

Still referring to Figure 9, the primer is not elongated in the absence of added 
dNTPs. However, the primer influences both the site and the rate of cleavage of the 
hairpin. The change in the site of cleavage (Fig. 9A) apparently results from 
disruption of a short duplex formed between the arms of the DNA substrate. In the 
absence of primer, the sequences indicated by underlining in Figure 6 could pair, 
forming an extended duplex. Cleavage at the end of the extended duplex would 
release the 1 1 nucleotide fragment seen on the Fig. 9A lanes with no added primer. 
Addition of excess primer (Fig. 9A, lanes 3 and 4) or incubation at an elevated 
temperature (Fig. 9B) disrupts the short extension of the duplex and results in a longer 
5' arm and, hence, longer cleavage products. 
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The location of the 3' end of the primer can influence the precise site of 
cleavage. Electrophoretic analysis revealed that in the absence of primer (Fig. 9B), 
cleavage occurs at the end of the substrate duplex (either the extended or shortened 
form, depending on the temperature) between the first and second base pairs. When 
the primer extends up to the base of the duplex, cleavage also occurs one nucleotide 
into the duplex. However, when a gap of four or six nucleotides exists between the 3 ' 
end of the primer and the substrate duplex, the cleavage site is shifted four to six 
nucleotides in the 5' direction. 

Fig. 10 describes the kinetics of cleavage in the presence (Fig. 10A) or absence 
(Fig. 10B) of a primer oligonucleotide. The reactions were run at 55°C with either 50 
mM KC1 (Fig. 10A) or 20 mM KC1 (Fig. 10B). The reaction products were resolved 
by denaturing poly aery lamide gel electrophoresis and the lengths of the products, in 
nucleotides, are indicated. "M", indicating a marker, is a 5' end-labeled 19-nt 
oligonucleotide. Under these salt conditions, Figs. 10A and 10B indicate that the 
reaction appears to be about twenty times faster in the presence of primer than in the 
absence of primer. This effect on the efficiency may be attributable to proper 
alignment and stabilization of the enzyme on the substrate. 

The relative influence of primer on cleavage rates becomes much greater when 
both reactions are run in 50 mM KC1. In the presence of primer, the rate of cleavage 
increases with KC1 concentration, up to about 50 mM. However, inhibition of this 
reaction in the presence of primer is apparent at 100 mM and is complete at 150 mM 
KC1. In contrast, in the absence of primer the rate is enhanced by concentration of 
KC1 up to 20 mM, but it is reduced at concentrations above 30 mM. At 50 mM KC1, 
the reaction is almost completely inhibited. The inhibition of cleavage by KC1 in the 
absence of primer is affected by temperature, being more pronounced at lower 
temperatures. 

Recognition of the 5' end of the arm to be cut appears to be an important 
feature of substrate recognition. Substrates that lack a free 5' end, such as circular 
Ml 3 DNA, cannot be cleaved under any conditions tested. Even with substrates 
having defined 5' arms, the rate of cleavage by DN APTaq is influenced by the length 
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of the arm. In the presence of primer and 50 raM KC1, cleavage of a 5' extension that 
is 27 nucleotides long is essentially complete within 2 minutes at 55°C. In contrast, 
cleavages of molecules with 5' arms of 84 and 188 nucleotides are only about 90% 
and 40% complete after 20 minutes. Incubation at higher temperatures reduces the 
inhibitory effects of long extensions indicating that secondary structure in the 5' arm 
or a heat-labile structure in the enzyme may inhibit the reaction. A mixing 
experiment, run under conditions of substrate excess, shows that the molecules with 
long arms do not preferentially tie up the available enzyme in non-productive 
complexes. These results may indicate that the 5' nuclease domain gains access to the 
cleavage site at the end of the bifurcated duplex by moving down the 5' arm from one 
end to the other. Longer 5' arms would be expected to have more adventitious 
secondary structures (particularly when KC1 concentrations are high), which would be 
likely to impede this movement. 

Cleavage does not appear to be inhibited by long 3' arms of either the substrate 
strand target molecule or pilot nucleic acid, at least up to 2 kilobases. At the other 
extreme, 3' arms of the pilot nucleic acid as short as one nucleotide can support 
cleavage in a primer-independent reaction, albeit inefficiently. Fully paired 
oligonucleotides do not elicit cleavage of DNA templates during primer extension. 

The ability of DNAPTb^ to cleave molecules even when the complementary 
strand contains only one unpaired 3' nucleotide may be useful in optimizing allele- 
specific PCR. PCR primers that have unpaired 3' ends could act as pilot 
oligonucleotides to direct selective cleavage of unwanted templates during 
preincubation of potential template-primer complexes with DNAPTaq in the absence of 
nucleoside triphosphates. 

B. 5' Nuclease Activities Of Other DNAPs 

To determine whether other 5' nucleases in other DNAPs would be suitable for 
the present invention, an array of enzymes, several of which were reported in the 
literature to be free of apparent 5' nuclease activity, were examined. The ability of 
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these other enzymes to cleave nucleic acids in a structure-specific manner was tested 
using the hairpin substrate shown in Fig. 6 under conditions reported to be optimal for 
synthesis by each enzyme. 

DNAPEcl and DNAP Klenow were obtained from Promega Corporation; the 

5 DNAP of Pyrococcus furious ["Pfu", Bargseid et al, Strategies 4:34 (1991)] was from 

Stratagene; the DNAP of Thermococcus litoralis ["Tli", Vent™(exo-), Perler et al, 
Proc. Natl. Acad. Sci. USA 89:5577 (1992)] was from New England Biolabs; the 
DNAP of Thermus flavus ['TfT, Kaledin et al, Biokhimiya 46:1576 (1981)] was from 
Epicentre Technologies; and the DNAP of Thermus thermophilus ["Tth", Carballeira et 

10 al, Biotechniques 9:276 (1990); Myers et al, Biochem. 30:7661 (1991)] was from 

U.S. Biochemicals. 

0.5 units of each DNA polymerase was assayed in a 20 \i\ reaction, using either 
the buffers supplied by the manufacturers for the primer-dependent reactions, or 
10 mM Tris-Cl, pH 8.5, 1.5 mM MgCl 2 , and 20mM KC1. Reaction mixtures were at 

15 held 72°C before the addition of enzyme. 

Figure 11 is an autoradiogram recording the results of these tests. Figure 11A 
demonstrates reactions of endonucleases of DNAPs of several thermophilic bacteria. 
The reactions were incubated at 55°C for 10 minutes in the presence of primer or at 
72°C for 30 minutes in the absence of primer, and the products were resolved by 

20 denaturing polyacrylamide gel electrophoresis. The lengths of the products, in 

nucleotides, are indicated. Figure 11B demonstrates endonucleolytic cleavage by the 
5' nuclease of DNAPEcl. The DNAPEcl and DNAP Klenow reactions were incubated 
for 5 minutes at 37°C. Note the light band of cleavage products of 25 and 11 
nucleotides in the DNAPEcl lanes (made in the presence and absence of primer, 

25 respectively). Figure 7B also demonstrates DNAP 7a reactions in the presence (+) or 

absence (-) of primer. These reactions were run in 50 mM and 20 mM KC1, 
respectively, and were incubated at 55°C for 10 minutes. 
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Referring to Figure 11 A, DNAPs from the eubacteria Thermus thermophilus 
and Thermus flavus cleave the substrate at the same place as DNAP Tag, both in the 
presence and absence of primer. In contrast, DNAPs from the archaebacteria 
Pyococcus furiosus and Thermococcus litoralis are unable to cleave the substrates 
endonucleolytically. The DNAPs from Pyrococcus furious and Thermococcus litoralis 
share little sequence homology with eubactenal enzymes (Ito et al, Nucl. Acids Res. 
19:4045 (1991); Mathur et al., Nucl. Acids. Res. 19:6952 (1991); see also Perler 
et al). Referring to Figure 11B, DNAPEcl also cleaves the substrate, but the resulting 
cleavage products are difficult to detect unless the 3' exonuclease is inhibited. The 
amino acid sequences of the 5' nuclease domains of DNAPEcl and DNAP Taq are 
about 38% homologous (Gelfand, supra). 

The 5' nuclease domain of DKAPTaq also shares about 19% homology with 
the 5' exonuclease encoded by gene 6 of bacteriophage T7 [Dunn et al., J. Mol. Biol. 
166:477 (1983)]. This nuclease, which is not covalently attached to a DNAP 
polymerization domain, is also able to cleave DNA endonucleolytically, at a site 
similar or identical to the site that is cut by the 5' nucleases described above, in the 
absence of added primers. 



C. Transcleavage 

The ability of a 5' nuclease to be directed to cleave efficiently at any specific 
20 sequence was demonstrated in the following experiment. A partially complementary 

oligonucleotide termed a "pilot oligonucleotide" was hybridized to sequences at the 
desired point of cleavage. The non-complementary part of the pilot oligonucleotide 
provided a structure analogous to the 3' arm of the template (see Figure 6), whereas 
the 5' region of the substrate strand became the 5' arm. A primer was provided by 
25 designing the 3' region of the pilot so that it would fold on itself creating a short 

hairpin with a stabilizing tetra-loop [Antao et al, Nucl. Acids Res. 19:5901 (1991)]. 
Two pilot oligonucleotides are shown in Figure 12A. Oligonucleotides 19-12 (SEQ ID 
NO:18) and 30-12 (SEQ ID NO: 19) are 31 or 42 or nucleotides long, respectively. 
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However, oligonucleotides 19-12 (SEQ ID NO:18) and 34-19 (SEQ ID NO: 19) have 
only 19 and 30 nucleotides, respectively, that are complementary to different sequences 
in the substrate strand. The pilot oligonucleotides are calculated to melt off their 
complements at about 50°C (19-12) and about 75°C (30-12). Both pilots have 12 
nucleotides at their 3' ends, which act as 3' arms with base-paired primers attached. 

To demonstrate that cleavage could be directed by a pilot oligonucleotide, we 
incubated a single-stranded target DNA with DNAPTbg in the presence of two 
potential pilot oligonucleotides. The transcleavage reactions, where the target and pilot 
nucleic acids are not covalently linked, includes 0.01 pmoles of single end-labeled 
substrate DNA, 1 unit of TMAPTaq and 5 pmoles of pilot oligonucleotide in a volume 
of 20 H l of the same buffers. These components were combined during a one minute 
incubation at 95°C, to denature the PCR-generated double-stranded substrate DNA, and 
the temperatures of the reactions were then reduced to their final incubation 
temperatures. Oligonucleotides 30-12 and 19-12 can hybridize to regions of the 
15 substrate DNAs that are 85 and 27 nucleotides from the 5' end of the targeted strand. 

Figure 21 shows the complete 206-mer sequence (SEQ ID NO:32). The 206- 
mer was generated by PCR . The M13/pUC 24-mer reverse sequencing (-48) primer 
and the M13/pUC sequencing (-47) primer from New England Biolabs (catalogue nos. 
1233 and 1224 respectively) were used (50 pmoles each) with the pGEM3z(f+) 
20 plasmid vector (Promega Corp.) as template (10 ng) containing the target sequences. 

The conditions for PCR were as follows: 50 uM of each dNTP and 2.5 units of Taq 
DNA polymerase in 100 ul of 20 mM Tris-Cl, pH 8.3, 1.5 mM MgCl 2 , 50 mM KC1 
with 0.05% Tween-20 and 0.05% NP-40. Reactions were cycled 35 times through 
95°C for 45 seconds, 63°C for 45 seconds, then 72°C for 75 seconds. After cycling, 
25 reactions were finished off with an incubation at 72°C for 5 minutes. The resulting 

fragment was purified by electrophoresis through a 6% polyacrylamide gel (29:1 cross 
link) in a buffer of 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA, visualized by 
ethidium bromide staining or autoradiography, excised from the gel, eluted by passive 
diffusion, and concentrated by ethanol precipitation. 
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Cleavage of the substrate DNA occurred in the presence of the pilot 
oligonucleotide 19-12 at 50°C (Figure 12B, lanes 1 and 7) but not at 75°C (lanes 4 
and 10). In the presence of oligonucleotide 30-12 cleavage was observed at both 
temperatures. Cleavage did not occur in the absence of added oligonucleotides 

5 (lanes 3, 6 and 12) or at about 80°C even though at 50°C adventitious structures in the 

substrate allowed primer- independent cleavage in the absence of KC1 (Figure 12B, 
lane 9). A non-specific oligonucleotide with no complementarity to the substrate DNA 
did not direct cleavage at 50°C, either in the absence or presence of 50 raM KC1 
(lanes 13 and 14). Thus, the specificity of the cleavage reactions can be controlled by 

10 the extent of complementarity to the substrate and by the conditions of incubation. 

D. Cleavage Of RNA 

An shortened RNA version of the sequence used in the transcleavage 
experiments discussed above was tested for its ability to serve as a substrate in the 
reaction. The RNA is cleaved at the expected place, in a reaction that is dependent 
15 upon the presence of the pilot oligonucleotide. The RNA substrate, made by T7 RNA 

polymerase in the presence of [ot- 32 P]UTP, corresponds to a truncated version of the 
DNA substrate used in Figure 12B. Reaction conditions were similar to those in used 
for the DNA substrates described above, with 50 mM KC1; incubation was for 40 
minutes at 55°C. The pilot oligonucleotide used is termed 30-0 (SEQ ID NO:20) and 

20 is shown in Figure 13 A. 

The results of the cleavage reaction is shown in Figure 13B. The reaction was 
run either in the presence or absence of DNAPTaa or pilot oligonucleotide as indicated 
in Figure 13B. 

Strikingly, in the case of RNA cleavage, a 3' arm is not required for the pilot 
25 oligonucleotide. It is very unlikely that this cleavage is due to previously described 

RNaseH, which would be expected to cut the RNA in several places along the 30 
base-pair long RNA-DNA duplex. The 5' nuclease of DNAP Jag is a structure- 
specific RNaseH that cleaves the RNA at a single site near the 5' end of the 
heteroduplexed region. 
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It is surprising that an oligonucleotide lacking a 3' arm is able to act as a pilot 
in directing„^fficient cleavage of an RNA target because such oligonucleotides are 
unable to direct efficient cleavage of DNA targets using native DNAPs. However, 
some 5' nucleases of the present invention (for example, clones E, F and G of Figure 
5 4) can cleave DNA in the absence of a 3' arm. In other words, a non-extendable 

cleavage structure is not required for specific cleavage with some 5' nucleases of the 
present invention derived from thermostable DNA polymerases. 

We tested whether cleavage of an RNA template by DNAPTaq in the presence 
of a fully complementary primer could help explain why DNAPTaq is unable to 
10 extend a DNA oligonucleotide on an RNA template, in a reaction resembling that of 

reverse transcriptase. Another thermophilic DNAP , DNAPTth, is able to use RNA as 
a template, but only in the presence of Mn++, so we predicted that this enzyme would 
not cleave RNA in the presence of this cation. Accordingly, we incubated an RNA 
molecule with an appropriate pilot oligonucleotide in the presence of DNAPTaq or 
15 DNAPTth, in buffer containing either Mg++ or Mn++. As expected, both enzymes 

cleaved the RNA in the presence of Mg++. However, DNAP7a<?, but not DNAPTth, 
degraded the RNA in the presence of Mn++. We conclude that the 5' nuclease 
activities of many DNAPs may contribute to their inability to use RNA as templates. 



EXAMPLE 2 

20 Generation Of 5' Nucleases From Thermostable DNA Polymerases 

Thermostable DNA polymerases were generated which have reduced syntheti 
activity, an activity that is an undesirable side-reaction during DNA cleavage in the 
detection assay of the invention, yet have maintained thermostable nuclease activity. 
The result is a thermostable polymerase which cleaves nucleic acids DNA with 
25 extreme specificity. 
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Type A DNA polymerases from eubacteria of the genus Thermus share 
extensive protein sequence identity (90% in the polymerization domain, using the 
Lipman-Pearson method in the DNA analysis software from DNAStar, WI) and behave 
similarly in both polymerization and nuclease assays. Therefore, we have used the 

5 genes for the DNA polymerase of Thermus aquaticus (DNAPTa?) and Thermus flavus 

(DNAPTfl) as representatives of this class. Polymerase genes from other eubacterial 
organisms, such as Thermus thermophilus, Thermus sp., Thermotoga maritima, 
Thermosipho africanus and Bacillus stearothermophilus are equally suitable. The 
DNA polymerases from these thermophilic organisms are capable of surviving and 

10 performing at elevated temperatures, and can thus be used in reactions in which 

temperature is used as a selection against non-specific hybridization of nucleic acid 
strands. 

The restriction sites used for deletion mutagenesis, described below, were 
chosen for convenience. Different sites situated with similar convenience are available 
in the Thermus thermophilus gene and can be used to make similar constructs with 
other Type A polymerase genes from related organisms. 



15 



A. Creation Of 5' Nuclease Constructs 
1. Modified DNAP Taq Genes 

The first step was to place a modified gene for the Taq DNA polymerase on a 
20 plasmid under control of an inducible promoter. The modified Taq polymerase gene 

was isolated as follows: The Taq DNA polymerase gene was amplified by polymerase 
chain reaction from genomic DNA from Thermus aquaticus, strain YT-1 (Lawyer et 
al, supra), using as primers the oligonucleotides described in SEQ ID NOS:13-14. 
The resulting fragment of DNA has a recognition sequence for the restriction 
endonuclease EcoRI at the 5' end of the coding sequence and a Bglll sequence at the 
3' end. Cleavage with Bglll leaves a 5' overhang or "sticky end" that is compatible 
with the end generated by BamHI. The PCR-amplified DNA was digested with EcoRI 



25 
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and BamHI. The 2512 bp fragment containing the coding region for the polymerase 
gene was gel purified and then ligated into a plasmid which contains an inducible 



In one embodiment of the invention, the pTTQ18 vector, which contains the 
hybrid trp-lac {tac) promoter, was used [M.J.R. Stark, Gene 5:255 (1987)] and shown 
in Figure 14. The tac promoter is under the control of the E. coli lac repressor. 
Repression allows the synthesis of the gene product to be suppressed until the desired 
level of bacterial growth has been achieved, at which point repression is removed by 
addition of a specific inducer, isopropyl-(3-D-thiogalactopyranoside (IPTG). Such a 
system allows the expression of foreign proteins that may slow or prevent growth of 
transformants. 

Bacterial promoters, such as tac, may not be adequately suppressed when they 
are present on a multiple copy plasmid. If a highly toxic protein is placed under 
control of such a promoter, the small amount of expression leaking through can be 
harmful to the bacteria. In another embodiment of the invention, another option for 
repressing synthesis of a cloned gene product was used. The non-bacterial promoter, 
from bacteriophage T7, found in the plasmid vector series pET-3 was used to express 
the cloned mutant Tag polymerase genes [Figure 15; Studier and Moffatt, J. Mol. Bio, 
189:113 (1986)]. This promoter initiates transcription only by T7 RNA polymerase. 
In a suitable strain, such as BL21(DE3)pLYS, the gene for this RNA polymerase is 
carried on the bacterial genome under control of the lac operator. This arrangement 
has the advantage that expression of the multiple copy gene (on the plasmid) is 
completely dependent on the expression of T7 RNA polymerase, which is easily 
suppressed because it is present in a single copy. 

For ligation into the pTTQ18 vector (Figure 14), the PCR product DNA 
containing the Taq polymerase coding region (xmxiTaq, clone 4B, SEQ ID NO:21) wa 
digested with EcoRI and Bglll and this fragment was ligated under standard "sticky 
end" conditions [Sambrook et al. Molecular Cloning, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, pp. 1.63-1.69 (1989)] into the EcoRI and BamHI sites of 



promoter. 
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the plasmid vector pTTQIS. Expression of this construct yields a translation^ fusion 
product in which the first two residues of the native protein (Met-Arg) are replaced by 
three from the vector (Met-Asn-Ser), but the remainder of the natural protein would 
not change. The construct was transformed into the JM109 strain of E. coli and the 
5 transformants were plated under incompletely repressing conditions that do not permit 

growth of bacteria expressing the native protein. These plating conditions allow the 
isolation of genes containing pre-existing mutations, such as those that result from the 
infidelity of Taq polymerase during the amplification process. 

Using this amplification/selection protocol, we isolated a clone (depicted in 
10 Figure 4B) containing a mutated Taq polymerase gene {vmxXTaq, clone 4B). The 

mutant was first detected by its phenotype, in which temperature-stable 5' nuclease 
activity in a crude cell extract was normal, but polymerization activity was almost 
absent (approximately less than 1% of wild type Taq polymerase activity). 

DNA sequence analysis of the recombinant gene showed that it had changes in 
15 the polymerase domain resulting in two amino acid substitutions: an A to G change at 

nucleotide position 1394 causes a Glu to Gly change at amino acid position 465 
(numbered according to the natural nucleic and amino acid sequences, SEQ ID NOS:l 
and 4) and another A to G change at nucleotide position 2260 causes a Gin to Arg 
change at amino acid position 754. Because the Gin to Gly mutation is at a 
20 nonconserved position and because the Glu to Arg mutation alters an amino acid that 

is conserved in virtually all of the known Type A polymerases, this latter mutation is 
most likely the one responsible for curtailing the synthesis activity of this protein. The 
nucleotide sequence for the Figure 4B construct is given in SEQ ID NO:21. The 
corresponding amino acid sequence encoded by the nucleotide sequence of SEQ ID 
25 NO:21 is listed in SEQ ID NO: 85. 

Subsequent derivatives of DNAPTaq constructs were made from the mutTaq 
gene, thus, they all bear these amino acid substitutions in addition to their other 
alterations, unless these particular regions were deleted. These mutated sites are 
indicated by black boxes at these locations in the diagrams in Figure 4. In Figure 4, 
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the designation "3' Exo" is used to indicate the location of the 3' exonuclease activity 
associated with Type A polymerases which is not present in DNAPTag. All constructs 
except the genes shown in Figures 4E, F and G were made in the pTTQ18 vector. 

The cloning vector used for the genes in Figures 4E and F was from the 
commercially available pET-3 series, described above. Though this vector series has 
only a BamHI site for cloning downstream of the T7 promoter, the series contains 
variants that allow cloning into any of the three reading frames. For cloning of the 
PCR product described above, the variant called pET-3c was used (Figure 15). The 
vector was digested with BamHI, dephosphorylated with calf intestinal phosphatase, 
and the sticky ends were filled in using the Klenow fragment of DNAPEcl and 
dNTPs. The gene for the mutant Taq DNAP shown in Figure 4B (mxxiTaq, clone 4B) 
was released from pTTQ18 by digestion with EcoRI and Sail, and the "sticky ends" 
were filled in as was done with the vector. The fragment was ligated to the vector 
under standard blunt-end conditions (Sambrook et al, Molecular Cloning, supra), the 
construct was transformed into the BL21(DE3)pLYS strain of E. coli, and isolates 
were screened to identify those that were ligated with the gene in the proper 
orientation relative to the promoter. This construction yields another translational 
fusion product, in which the first two amino acids of DNAPIiag (Met-Arg) are 
replaced by 13 from the vector plus two from the PCR primer (Met-Ala-Ser-Met-Thr- 
Gly-Gly-Gln-Gln-Met-Gly-Arg-Ile-Asn-Ser) (SEQ ID NO:29). 

Our goal was to generate enzymes that lacked the ability to synthesize DNA, 
but retained the ability to cleave nucleic acids with a 5' nuclease activity. The act of 
primed, templated synthesis of DNA is actually a coordinated series of events, so it is 
possible to disable DNA synthesis by disrupting one event while not affecting the 
others. These steps include, but are not limited to, primer recognition and binding, 
dNTP binding and catalysis of the inter-nucleotide phosphodiester bond. Some of the 
amino acids in the polymerization domain of DNAPEcl have been linked to these 
functions, but the precise mechanisms are as yet poorly defined. 



- 116 - 




10 



15 



20 



25 



One way of destroying the polymerizing ability of a DNA polymerase is to 
delete all or part of the gene segment that encodes that domain for the protein, or to 
otherwise render the gene incapable of making a complete polymerization domain. 
Individual mutant enzymes may differ from each other in stability and solubility both 
inside and outside cells. For instance, in contrast to the 5' nuclease domain of 
DNAPEcI, which can be released in an active form from the polymerization domain 
by gentle proteolysis [Setlow and Kornberg, J. Biol. Chem. 247:232 (1972)], the 
Thermus nuclease domain, when treated similarly, becomes less soluble and the 

cleavage activity is often lost. 

Using the mutant gene shown in Figure 4B as starting material, several deletion 
constructs were created. All cloning technologies were standard (Sambrook et al, 
supra) and are summarized briefly, as follows: 

Figure 4C: The mvXTaq construct was digested with PstI, which cuts once 
within the polymerase coding region, as indicated, and cuts immediately downstream 
of the gene in the multiple cloning site of the vector. After release of the fragment 
between these two sites, the vector was re-ligated, creating an 894-nucleotide deletion, 
and bringing into frame a stop codon 40 nucleotides downstream of the junction. The 
nucleotide sequence of this 5' nuclease (clone 4C) is given in SEQ ID NO:9. The 
corresponding ammo acid sequence encoded by the nucleotide sequence of SEQ ID 
NO:9 is listed in SEQ ID NO:86. 

Figure 4D: The mx&Taq construct was digested with Nhel, which cuts once in 
the gene at position 2047. The resulting four-nucleotide 5' overhanging ends were 
filled in, as described above, and the blunt ends were re-ligated. The resulting four- 
nucleotide insertion changes the reading frame and causes termination of translation 
ten amino acids downstream of the mutation. The nucleotide sequence of this 5' 
nuclease (clone 4D) is given in SEQ ID NO:10. The corresponding amino acid 
sequence encoded by the nucleotide sequence of SEQ ID NO:10 is listed in SEQ ID 
NO:87. 
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Figure 4E: The entire mvfiTaq gene was cut from pTTQIS using EcoRI and 
Sail and cloned into pET-3c, as described above. This clone was digested with BstXI 
and Xcml, at unique sites that are situated as shown in Figure 4E. The DNA was 
treated with the Klenow fragment of DNAPEcl and dNTPs, which resulted in the 3' 
overhangs of both sites being trimmed to blunt ends. These blunt ends were ligated 
together, resulting in an out-of-frame deletion of 1540 nucleotides. An in-frame 
termination codon occurs 18 triplets past the junction site. The nucleotide sequence of 
this 5' nuclease (clone 4E) is given in SEQ ID NO:ll [The corresponding amino acid 
sequence encoded by the nucleotide sequence of SEQ ID NO: 11 is listed in SEQ ID 
NO:88]., with the appropriate leader sequence given in SEQ ID NO:30 [The 
corresponding amino acid sequence encoded by the nucleotide sequence of SEQ ID 
NO:30 is listed in SEQ ID NO:89.. It is also referred to as the Cleavase™ BX. 
enzyme. 

Figure 4F: The entire mutTaq gene was cut from pTTQIS using EcoRI and 
Sail and cloned into pET-3c, as described above. This clone was digested with BstXI 
and BamHI, at unique sites that are situated as shown in the diagram. The DNA was 
treated with the Klenow fragment of DNAPEcl and dNTPs, which resulted in the 3' 
overhang of the BstX I site being trimmed to a blunt end, while the 5' overhang of the 
Bam HI site was filled in to make a blunt end. These ends were ligated together, 
resulting in an in-frame deletion of 903 nucleotides. The nucleotide sequence of the 5' 
nuclease (clone 4F) is given in SEQ ID NO:12. It is also referred to as the 
Cleavase™ BB enzyme. The corresponding amino acid sequence encoded by the 
nucleotide sequence of SEQ ID NO:12 is listed in SEQ ID NO:90. 

Figure 4G: This polymerase is a variant of that shown in Figure 4E. It was 
cloned in the plasmid vector pET-21 (Novagen). The non-bacterial promoter from 
bacteriophage T7, found in this vector, initiates transcription only by T7 RNA 
polymerase. See Studier and Moffatt, supra. In a suitable strain, such as (DES)pLYS, 
the gene for this RNA polymerase is carried on the bacterial genome under control of 
the lac operator. This arrangement has the advantage that expression of the multiple 
copy gene (on the plasmid) is completely dependent on the expression of T7 RNA 
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polymerase, which is easily suppressed because it is present in a single copy. Because 
the expression of these mutant genes is under this tightly controlled promoter, potential 
problems of toxicity of the expressed proteins to the host cells are less of a concern. 
The pET-21 vector also features a "His-Tag", a stretch of six consecutive 
5 histidine residues that are added on the carboxy terminus of the expressed proteins. 

The resulting proteins can then be purified in a single step by metal chelation 
chromatography, using a commercially available (Novagen) column resin with 
immobilized Ni^ ions. The 2.5 ml columns are reusable, and can bind up to 20 mg of 
the target protein under native or denaturing (guanidine-HCl or urea) conditions. 
10 E coU (DES)pLYS cells are transformed with the constructs described above 

using standard transformation techniques, and used to inoculate a standard growth 
medium {e.g., Luria-Bertani broth). Production of T7 RNA polymerase is induced 
during log phase growth by addition of IPTG and incubated for a further 12 to 17 
hours. Aliquots of culture are removed both before and after induction and the 
15 proteins are examined by SDS-PAGE. Staining with Coomassie Blue allows 

visualization of the foreign proteins if they account for about 3-5% of the cellular 
protein and do not co-migrate with any of the major host protein bands. Proteins that 
co-migrate with major host proteins must be expressed as more than 10% of the total 
protein to be seen at this stage of analysis. 
20 Some mutant proteins are sequestered by the cells into inclusion bodies. These 

are granules that form in the cytoplasm when bacteria are made to express high levels 
of a foreign protein, and they can be purified from a crude lysate, and analyzed by 
SDS-PAGE to determine their protein content. If the cloned protein is found in the 
inclusion bodies, it must be released to assay the cleavage and polymerase activities. 
25 Different methods of solubilization may be appropriate for different proteins, and a 

variety of methods are known. See e.g., Builder & Ogez, U.S. Patent No. 4,51 1,502 
(1985); Olson, U.S. Patent No. 4,518,526 (1985); Olson & Pai, U.S. Patent No. 
4,511,503 (1985); Jones et al, U.S. Patent No. 4,512,922 (1985), all of which are 
hereby incorporated by reference. 
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The solubilized protein is then purified on the Ni^ column as described above, 
following the manufacturers instructions (Novagen). The washed proteins are eluted 
from the column by a combination of imidazole competitor (1 M) and high salt (0.5 M 
NaCl), and dialyzed to exchange the buffer and to allow denatured proteins to refold. 
5 Typical recoveries result in approximately 20 ug of specific protein per ml of starting 

culture. The DNAP mutant is referred to as the Cleavase™ BN enzyme and the 
sequence is given in SEQ ID NO:31. The corresponding amino acid sequence encoded 
by the nucleotide sequence of SEQ ID NO:31 is listed in SEQ ID NO:91. 

2. Modified DNAPTfl Gene 

10 The DNA polymerase gene of Thermus flavus was isolated from the "T. flavus" 

AT-62 strain obtained from the American Type Tissue Collection (ATCC 33923). 
This strain has a different restriction map then does the T. flavus strain used to 
generate the sequence published by Akhmetzjanov and Vakhitov, supra. The 
published sequence is listed as SEQ ID NO:2. No sequence data has been published 
15 for the DNA polymerase gene from the AT-62 strain of T. flavus. 

Genomic DNA from T. flavus was amplified using the same primers used to 
amplify the T. aquaticus DNA polymerase gene (SEQ ID NOS: 13-14). The 
approximately 2500 base pair PCR fragment was digested with EcoRI and BamHI. 
The over-hanging ends were made blunt with the Klenow fragment of DNAPEcl and 
20 dNTPs. The resulting approximately 1800 base pair fragment containing the coding 

region for the N-terminus was ligated into pET-3c, as described above. This construct, 
clone 5B, is depicted in Figure 5B. The wild type T. flavus DNA polymerase gene is 
depicted in Figure 5A. In Figure 5, the designation "3' Exo" is used to indicate the 
location of the 3' exonuclease activity associated with Type A polymerases which is 
25 not present in DNAPTfl. The 5B clone has the same leader amino acids as do the 

DNAPTaq clones 4E and F which were cloned into pET-3c; it is not known precisely 
where translation termination occurs, but the vector has a strong transcription 
termination signal immediately downstream of the cloning site. 
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B. Growth And Induction Of Transformed Cells 

Bacterial cells were transformed with the constructs described above using 
standard transformation techniques and used to inoculate 2 mis of a standard growth 
medium {e.g., Luria-Bertani broth). The resulting cultures were incubated as 
appropriate for the particular strain used, and induced if required for a particular 
expression system. For all of the constructs depicted in Figures 4 and 5, the cultures 
were grown to an optical density (at 600nm wavelength) of 0.5 OD. 

To induce expression of the cloned genes, the cultures were brought to a final 
concentration of 0.4 raM IPTG and the incubations were continued for 12 to 17 hours. 
50 ix\ aliquots of each culture were removed both before and after induction and were 
combined with 20 M l of a standard gel loading buffer for sodium dodecyl sulfate- 
polyacrylamide gel electrophoresis (SDS-PAGE). Subsequent staining with Coomassie 
Blue (Sambrook et ah, supra) allows visualization of the foreign proteins if they 
account for about 3-5% of the cellular protein and do not co-migrate with any of the 
major E. coli protein bands. Proteins that do co-migrate with a major host protein 
must be expressed as more than 10% of the total protein to be seen at this stage of 
analysis. 
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C. Heat Lysis And Fractionation 

Expressed thermostable proteins, i.e., the 5' nucleases, were isolated by heating 
crude bacterial cell extracts to cause denaturation and precipitation of the less stable E 
coli proteins. The precipitated E. coli proteins were then, along with other cell debris, 
removed by centrifugation. 1.7 mis of the culture were pelleted by microcentnfugation 
at 12,000 to 14,000 rpm for 30 to 60 seconds. After removal of the supernatant, the 
cells were resuspended in 400 pi of buffer A (50 mM Tris-HCl, pH 7.9, 50 mM 
dextrose, 1 mM EDTA), re-centrifuged, then resuspended in 80 fi\ of buffer A with 
4mg/ml lysozyme. The cells were incubated at room temperature for 15 minutes, then 
combined with 80 »\ of buffer B (10 mM Tris-HCl, pH 7.9, 50 mM KC1, 1 mM 
EDTA, 1 mM PMSF, 0.5% Tween-20, 0.5% Nonidet-P40). 
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This mixture was incubated at 75°C for 1 hour to denature and precipitate the 
host proteins. This cell extract was centnfuged at 14,000 rpm for 15 minutes at 4°C, 
and the supernatant was transferred to a fresh tube. An aliquot of 0.5 to 1 p\ of this 
supernatant was used directly in each test reaction, and the protein content of the 
extract was determined by subjecting 7 M l to electrophoretic analysis, as above. The 
native recombinant Tag DNA polymerase [Englke, Anal. Biochem 191:396 (1990)], 
and the double point mutation protein shown in Figure 4B are both soluble and active 
at this point. 

The foreign protein may not be detected after the heat treatments due to 
sequestration of the foreign protein by the cells into inclusion bodies. These are 
granules that form in the cytoplasm when bacteria are made to express high levels of a 
foreign protein, and they can be purified from a crude lysate, and analyzed SDS PAGE 
to determine their protein content. Many methods have been described in the 
literature, and one approach is described below. 
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D. Isolation And Solubilization Of Inclusion Bodies 

A small culture was grown and induced as described above. A 1.7 ml aliquot 
was pelleted by brief centrifugation, and the bacterial cells were resuspended in 100 pi 
of Lysis buffer (50 mM Tris-HCl, pH 8.0, 1 mM EDTA, 100 mM NaCl). 2.5 M l of 
20 mM PMSF were added for a final concentration of 0.5 mM, and lysozyme was 
added to a concentration of 1.0 mg/ml. The cells were incubated at room temperature 
for 20 minutes, deoxycholic acid was added to lmg/ml (1 M l of 100 mg/ml solution), 
and the mixture was further incubated at 37°C for about 15 minutes or until viscous. 
DNAse I was added to 10 M g/ml and the mixture was incubated at room temperature 
for about 30 minutes or until it was no longer viscous. 

From this mixture the inclusion bodies were collected by centrifugation at 
14,000 rpm for 15 minutes at 4°C, and the supernatant was discarded. The pellet was 
resuspended in 100 M l of lysis buffer with lOmM EDTA (pH 8.0) and 0.5% Triton X- 
100. After 5 minutes at room temperature, the inclusion bodies were pelleted as 
before, and the supernatant was saved for later analysis. The inclusion bodies were 
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resuspended in 50 „1 of distilled water, and 5 M l was combined with SDS gel loading 
buffer (which dissolves the inclusion bodies) and analyzed electrophoretically, along 
with an aliquot of the supernatant. 

If the cloned protein is found in the inclusion bodies, it may be released to 
5 assay the cleavage and polymerase activities and the method of solubilization must be 

compatible with the particular activity. Different methods of solubilization may be 
appropriate for different proteins, and a variety of methods are discussed in Molecular 
Cloning (Sambrook et al, supra). The following is an adaptation we have used for 
several of our isolates. 

10 20 Ml of the inclusion body-water suspension were pelleted by centrifugation at 

14,000 rpm for 4 minutes at room temperature, and the supernatant was discarded. To 
further wash the inclusion bodies, the pellet was resuspended in 20^1 of lysis buffer 
with 2M urea, and incubated at room temperature for one hour. The washed inclusion 
bodies were then resuspended in 2 M l of lysis buffer with 8M urea; the solution 
15 clarified visibly as the inclusion bodies dissolved. Undissolved debris was removed by 

centrifugation at 14,000 rpm for 4 minutes at room temperature, and the extract 
supernatant was transferred to a fresh tube. 

To reduce the urea concentration, the extract was diluted into KH 2 P0 4 . A fresh 
tube was prepared containing 180 M l of 50 mM KH 2 P0 4 , P H 9.5, 1 mM EDTA and 50 
20 mM NaCl. A 2 p\ aliquot of the extract was added and vortexed briefly to mix. This 

step was repeated until all of the extract had been added for a total of 10 additions. 
The mixture was allowed to sit at room temperature for 15 minutes, during which time 
some precipitate often forms. Precipitates were removed by centrifugation at 14,000 
rpm for 15 minutes at room temperature, and the supernatant was transferred to a 
25 fresh tube. To the 200 M l of protein in the KH 2 P0 4 solution, 140-200 ,1 of saturated 

(NH 4 ) 2 S0 4 were added, so that the resulting mixture was about 41% to 50% saturated 
(NH 4 ) 2 SO, The mixture was chilled on ice for 30 minutes to allow the protein to 
precipitate, and the protein was then collected by centrifugation at 14,000 rpm, for 4 
minutes at room temperature. The supernatant was discarded, and the pellet was 
30 dissolved in 20 ,X Buffer C (20 mM HEPES, pH 7.9, 1 mM EDTA, 0.5% PMSF, 25 
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mM KC1 and 0.5 % each of Tween-20 and Nonidet P 40). The protein solution was 
centrifuged again for 4 minutes to pellet insoluble materials, and the supernatant was 
removed to a fresh tube. The protein contents of extracts prepared in this manner 
were visualized by resolving 1-4 ,1 by SDS-PAGE; 0.5 to 1 ,1 of extract was tested in 
the cleavage and polymerization assays as described. 

E. Protein Analysis For Presence Of Nuclease And 
Synthetic Activity 

The 5' nucleases described above and shown in Figures 4 and 5 were analyzed 
by the following methods. 

1. Structure Specific Nuclease Assay 

A candidate modified polymerase is tested for 5' nuclease activity by 
examining its ability to catalyze structure-specific cleavages. By the term "cleavage 
structure" as used herein, is meant a nucleic acid structure which is a substrate for 
cleavage by the 5' nuclease activity of a DNAP. 

The polymerase is exposed to test complexes that have the structures shown in 
Figure 16. Testing for 5' nuclease activity involves three reactions: 1) a primer- 
directed cleavage (Figure 16B) is performed because it is relatively insensitive to 
variations in the salt concentration of the reaction and can, therefore, be performed m 
whatever solute conditions the modified enzyme requires for activity; this is generally 
the same conditions preferred by unmodified polymerases; 2) a similar primer-directed 
cleavage is performed in a buffer which permits primer-independent cleavage, i.e., a 
low salt buffer, to demonstrate that the enzyme is viable under these conditions; and 3) 
a primer-independent cleavage (Figure 16A) is performed in the same low salt buffer. 

The bifurcated duplex is formed between a substrate strand and a template 
strand as shown in Figure 16. By the term "substrate strand" as used herein, is meant 
that strand of nucleic acid in which the cleavage mediated by the 5' nuclease activity 
occurs. The substrate strand is always depicted as the top strand in the bifurcated 
complex which serves as a substrate for 5' nuclease cleavage (Figure 16). By the term 
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"template strand" as used herein, is meant the strand of nucleic acid which ,s at least 
partially complementary to the substrate strand and which anneals to the substrate 
strand to form the cleavage structure. The template strand is always depicted as the 
bottom strand of the bifurcated cleavage structure (figure 16). If a primer (a short 
5 ohgonuclcotide of 19 to 30 nucleotides in length) is added to the complex, as when 

primer-dependent cleavage ,s to be tested, it ,s designed to anneal to the 3' am. of the 
template strand (Figure 16B). Such a primer would be extended along the template 
strand if the polymerase used in the reaction has synthetic activtty. 

The cleavage structure may be made as a single hairpin molecule, wtth the 3' 
10 end of the targe, and the 5' end of the pilot joined as a loop as shown in figure 16E. 
A primer oi.gonucleotide complementary to the 3' arm is also requtred for these tests 
so that the enzyme's sensitivity to the presence of a primer may be tested. 

Nucleic acids to be used to form test cleavage structures can be chemically 
synthesized, or can be generated by standard recombinant DNA techniques. By the 
IS latter method, the hairpin portion of the molecule can be created by inserting into a 

cloning vector duplicate copies of a short DNA segment, adjacent to each other but m 
opposing orientation. The double-stranded fragment encompassing this inverted repeat, 
and including enough flaring sequence to give short (about 20 nucleotides) unpaired 
5- and 3' arms, can then be released from the vector by restriction enzyme d.gestion, 
20 or by PCR performed with an enzyme lacking a 5' exonuclease (e.g. , the Stoffel 
fragment of Amplitaq™ DNA polymerase, Vent™ DNA polymerase). 

The test DNA can be labeled on either end, or internally, with either a 
rad,o,so.ope, or with a „on-,sotopic tag. Whether the hairpin DNA is a synthetic 
single strand or a cloned double strand, the DNA is heated prior to use to melt all 
25 duplexes. When cooied on ice, the structure depicted in Figure 16E is formed, and is 

stable for sufficient time to perform these assays. 

To test for primer-directed cleavage (Reaction 1), a detectable quantity of the 
test molecule (typically 1-100 fmo. of "P-labeled hairpin molecule) and a 10 to 100- 
fold molar excess of primer are placed in a buffer known to be compatible with the 
30 test enzyme. For Reaction 2, where primer-directed cleavage is performed under 
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condition which allow primer-independent cleavage, the same quantities of molecules 
are placed in a solution that is the same as the buffer used in Reaction 1 regardmg pH, 
enzyme stabilizers (e.g., bovine serum albumin, nonionic detergents, gelatin) and 
reducing agents (e.g., dithiothreitol, 2-mercaptoethanol) but that replaces any 
monovalent cation salt with 20 mM KC1; 20 mM KC1 is the demonstrated opttmum for 
primer-independent cleavage. Buffers for enzymes, such as DNAPEcl, that usually 
operate in the absence of salt are not supplemented to ach.eve this concentrate. To 
test for primer-independent cleavage (Reaction 3) the same quantity of the test 
molecule, but no primer, are combined under the same buffer condittons used for 
R.C£tction 2 

All three test reactions are then exposed to enough of the enzyme that the 
molar ratio of enzyme to test complex is approximately 1:1. The reactions are 
incubated at a range of temperatures up to, but not exceeding, the temperature allowed 
by either the enzyme stability or the complex stability, whichever is lower, up to 80°C 
for enzymes from thermophiles, for a time sufficient to allow cleavage (10 to 60 
minutes). The products of Reactions 1, 2 and 3 are resolved by denaturing 
polyacrylamide gel electrophoresis, and visualized by autoradiography or by a 
comparable method appropriate to the labeling system used. Additional labeling 
systems include chemiluminescence detection, silver or other stains, blotting and 
probing and the like. The presence of cleavage products is indicated by the presence 
of molecules which migrate at a lower molecular weight than does the uncleaved test 
structure. These cleavage products indicate that the candidate polymerase has 
structure-specific 5' nuclease activity. 

To determine whether a modified DNA polymerase has substantially the same 
5' nuclease activity as that of the native DNA polymerase, the results of the above- 
described tests are compared with the results obtained from these tests performed with 
the native DNA polymerase. By "substantially the same 5' nuclease activity" we mean 
that the modified polymerase and the native polymerase will both cleave test molecules 
in the same manner. It is not necessary that the modified polymerase cleave at the 
same rate as the native DNA polymerase. 
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Some enzymes or enzyme preparations may have other associated or 
contaminating activities that may be functional under the cleavage conditions described 
above and that may interfere with 5' nuclease detection. Reaction conditions can be 
modified in consideration of these other activities, to avoid destruction of the substrate, 
5 or other masking of the 5' nuclease cleavage and its products. For example, the DNA 

polymerase I of E. coli (Pol I), in addition to its polymerase and 5' nuclease activities, 
has a 3' exonuclease that can degrade DNA in a 3' to 5' direction. Consequently, 
when the molecule in Figure 16E is exposed to this polymerase under the conditions 
described above, the 3' exonuclease quickly removes the unpaired 3' arm, destroying 
10 the bifurcated structure required of a substrate for the 5' exonuclease cleavage and no 

cleavage is detected. The true ability of Pol I to cleave the structure can be revealed if 
the 3' exonuclease is inhibited by a change of conditions {e.g., pH), mutation, or by 
addition of a competitor for the activity. Addition of 500 pmoles of a single- stranded 
competitor oligonucleotide, unrelated to the Figure 16E structure, to the cleavage 
15 reaction with Pol I effectively inhibits the digestion of the 3' arm of the Figure 16E 

structure without interfering with the 5' exonuclease release of the 5' arm. The 
concentration of the competitor is not critical, but should be high enough to occupy the 
3' exonuclease for the duration of the reaction. 

Similar destruction of the test molecule may be caused by contaminants in the 
20 candidate polymerase preparation. Several sets of the structure specific nuclease 

reactions may be performed to determine the purity of the candidate nuclease and to 
find the window between under and over exposure of the test molecule to the 
polymerase preparation being investigated. 

The above described modified polymerases were tested for 5' nuclease activity 
25 as follows: Reaction 1 was performed in a buffer of 10 mM Tris-Cl, pH 8.5 at 20°C, 

1.5 mM MgCl 2 and 50 mM KC1 and in Reaction 2 the KC1 concentration was reduced 
to 20 mM. In Reactions 1 and 2, 10 fmoles of the test substrate molecule shown in 
Figure 16E were combined with 1 pmole of the indicated primer and 0.5 to 1.0 uf of 
extract containing the modified polymerase (prepared as described above). This 
30 mixture was then incubated for 10 minutes at 55°C. For all of the mutant polymerases 
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tested these conditions were sufficient to give complete cleavage. When the molecule 
shown in Figure 16E was labeled at the 5' end, the released 5' fragment, 25 
nucleotides long, was conveniently resolved on a 20% polyacrylamide gel (19:1 cross- 
linked) with 7 M urea in a buffer containing 45 mM Tris-borate pH 8.3, 1.4 mM 
EDTA. Clones 4C-F and 5B exhibited structure-specific cleavage comparable to that 
of the unmodified DNA polymerase. Additionally, clones 4E, 4F and 4G have the 
added ability to cleave DNA in the absence of a 3' arm as discussed above. 
Representative cleavage reactions are shown in Figure 17. 

For the reactions shown in Figure 17, the mutant polymerase clones 4E {Tag 
mutant) and 5B (Tfl mutant) were examined for their ability to cleave the hairpin 
substrate molecule shown in Figure 16E. The substrate molecule was labeled at the 5' 
terminus with 32 P. Ten fmoles of heat-denatured, end-labeled substrate DNA and 0.5 
units of DNAP7a<7 (lane 1) or 0.5 ul of 4e or 5b extract (Figure 17, lanes 2-7, extract 
was prepared as described above) were mixed together in a buffer containing 10 mM 
15 Tris-Cl, pH 8.5, 50 mM KC1 and 1.5 mM MgCl 2 . The final reaction volume was 

10 ul. Reactions shown in lanes 4 and 7 contain in addition 50 uM of each dNTP. 
Reactions shown m lanes 3, 4, 6 and 7 contain 0.2 uM of the primer oligonucleotide 
(complementary to the 3' arm of the substrate and shown in Figure 16E). Reactions 
were incubated at 55° C for 4 minutes. Reactions were stopped by the addition of 8 
Hi of 95% formamide containing 20 mM EDTA and 0.05% marker dyes per 10 ul 
reaction volume. Samples were then applied to 12% denaturing acrylamide gels. 
Following electrophoresis, the gels were autoradiographed. Figure 17 shows that 
clones 4E and 5B exhibit cleavage activity similar to that of the native DNAPTaq. 
Note that some cleavage occurs in these reactions in the absence of the primer. When 
25 long hairpin structure, such as the one used here (Figure 16E), are used in cleavage 

reactions performed in buffers containing 50 mM KC1 a low level of primer- 
independent cleavage is seen. Higher concentrations of KC1 suppress, but do not 
eliminate, this primer-independent cleavage under these conditions. 



20 
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2. Assay For Synthetic Activity 

The ability of the modified enzyme or proteolytic fragments is assayed by 
adding the modified enzyme to an assay system in which a primer is annealed to a 
template and DNA synthesis is catalyzed by the added enzyme. Many standard 
5 laboratory techniques employ such an assay. For example, nick translation and 

enzymatic sequencing involve extension of a primer along a DNA template by a 

polymerase molecule. 

In a preferred assay for determining the synthetic activity of a modified enzyme 
an oligonucleotide primer is annealed to a single- stranded DNA template, e.g., 
10 bacteriophage M13 DNA, and the primer/template duplex is incubated in the presence 

of the modified polymerase in question, deoxynucleoside triphosphates (dNTPs) and 
the buffer and salts known to be appropriate for the unmodified or native enzyme. 
Detection of either primer extension (by denaturing gel electrophoresis) or dNTP 
incorporation (by acid precipitation or chromatography) is indicative of an active 
15 polymerase. A label, either isotopic or non-isotopic, is preferably included on either 

the primer or as a dNTP to facilitate detection of polymerization products. Synthetic 
activity is quantified as the amount of free nucleotide incorporated into the growing 
DNA chain and is expressed as amount incorporated per unit of time under specific 
reaction conditions. 

20 Representative results of an assay for synthetic activity is shown in Figure 18. 

The synthetic activity of the mutant WPXTaq clones 4B-F was tested as follows: A 
master mixture of the following buffer was made: 1.2X PCR buffer (IX PCR buffer 
contains 50 mM KC1, 1.5 mM MgCl 2 , 10 mM Tris-Cl, ph 8.5 and 0.05% each Tween 
20 and Nonidet P40), 50 each of dGTP, dATP and dTTP, 5 uM dCTP and 0.125 

25 uM a- 32 P-dCTP at 600 Ci/mmol. Before adjusting this mixture to its final volume, it 

was divided into two equal aliquots. One received distilled water up to a volume of 
50 nl to give the concentrations above. The other received 5 Hg of single-stranded 
M13mpl8 DNA (approximately 2.5 pmol or 0.05 uM final concentration) and 250 
pmol of M13 sequencing primer (5 uM final concentration) and distilled water to a 
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final volume of 50 jxl. Each cocktail was warmed to 75°C for 5 minutes and then 
cooled to room temperature. This allowed the primers to anneal to the DNA in the 

DNA-containing mixtures. 

For each assay, 4 ul of the cocktail with the DNA was combined with 1 ul of 
the mutant polymerase, prepared as described, or 1 unit of DNAPTag (Perkm Elmer) 
in 1 ul of dH 2 0. A "no DNA" control was done in the presence of the DNAPTag 
(Figure 18, lane 1), and a "no enzyme" control was done using water in place of the 
enzyme (lane 2). Each reaction was mixed, then incubated at room temperature 
(approx. 22°Q for 5 minutes, then at 55°C for 2 minutes, then at 72°C for 2 minutes. 
This step incubation was done to detect polymerization in any mutants that might have 
optimal temperatures lower than 72°C. After the final incubation, the tubes were spun 
briefly to collect any condensation and were placed on ice. One ul of each reaction 
was spotted at an origin 1.5 cm from the bottom edge of a polyethyleneimme (PEI) 
cellulose thin layer chromatography plate and allowed to dry. The chromatography 
plate was run in 0.75 M NaH 2 P0 4 , pH 3.5, until the buffer front had run 
approximately 9 cm from the origin. The plate was dried, wrapped m plastic wrap, 
marked with luminescent ink, and exposed to X-ray film. Incorporation was detected 
as counts that stuck where originally spotted, while the unincorporated nucleotides 
were carried by the salt solution from the origin. 

Comparison of the locations of the counts with the two control lanes confirmed 
the lack of polymerization activity in the mutant preparations. Among the modified 
DNAPTag clones, only clone 4B retains any residual synthetic activity as shown in 
Figure 18. 
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EXAMPLE 3 

5' Nucleases Derived From Thermostable DNA 
Polymerases Can Cleave Short Hairpin Structures With Specificity 

The ability of the 5' nucleases to cleave hairpin structures to generate a cleaved 
5 hairpin structure suitable as a detection molecule was examined. The structure and 

sequence of the hairpin test molecule is shown in Figure 19A (SEQ ID NO:15). The 
oligonucleotide (labeled "primer" in Figure 19A, SEQ ID NO:22) is shown annealed to 
its complementary sequence on the 3' arm of the hairpm test molecule. The hairpin 
test molecule was single-end labeled with 32 P using a labeled T7 promoter primer in a 
10 polymerase chain reaction. The label is present on the 5' arm of the hairpin test 

molecule and is represented by the star in Figure 19 A. 

The cleavage reaction was performed by adding 10 fmoles of heat-denatured, 
end-labeled hairpin test molecule, 0.2uM of the primer oligonucleotide (complementary 
to the 3' arm of the hairpin), 50 uM of each dNTP and 0.5 units of VKAPTaq (Perkin 
15 Elmer) or 0.5 ul of extract containing a 5' nuclease (prepared as described above) in a 

total volume of 10 ^1 in a buffer containing 10 raM Tris-Cl, pH 8.5, 50 mM KC1 and 
1.5 mM MgCl 2 . Reactions shown in lanes 3, 5 and 7 were run in the absence of 
dNTPs. 

Reactions were incubated at 55° C for 4 minutes. Reactions were stopped at 
20 55° C by the addition of 8 ul of 95% formamide with 20 mM EDTA and 0.05% 

marker dyes per 10 ul reaction volume. Samples were not heated before loading onto 
denaturing polyacrylamide gels (10% polyacrylamide, 19:1 crosslinking, 7 M urea, 89 
mM Tris-borate, pH 8.3, 2.8 mM EDTA). The samples were not heated to allow for 
the resolution of single-stranded and re-duplexed uncleaved hairpin molecules. 
25 Figure 19B shows that altered polymerases lacking any detectable synthetic 

activity cleave a hairpin structure when an oligonucleotide is annealed to the single- 
stranded 3' arm of the hairpin to yield a single species of cleaved product (Figure 19B, 
lanes 3 and 4). 5' nucleases, such as clone 4D, shown in lanes 3 and 4, produce a 
single cleaved product even in the presence of dNTPs. 5' nucleases which retain a 
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residual amount of synthetic activity (less than 1% of wild type activity) produce 
multiple cleavage products as the polymerase can extend the oligonucleotide annealed 
to the 3' arm of the hairpin thereby moving the site of cleavage (clone 4B, lanes 5 and 
6). Native UNATaq produces even more species of cleavage products than do mutant 
polymerases retaining residual synthetic activity and additionally converts the hairpin 
structure to a double- stranded form in the presence of dNTPs due to the high level of 
synthetic activity in the native polymerase (Figure 19B, lane 8). 

EXAMPLE 4 

Test Of The Trigger/Detection Assay 

To test the ability of an oligonucleotide of the type released in the trigger 
reaction of the trigger/detection assay to be detected in the detection reaction of the 
assay, the two hairpin structures shown in Figure 20A were synthesized using standard 
techniques. The two hairpins are termed the A-hairpin (SEQ ID NO:23) and the T- 
hairpin (SEQ ID NO:24). The predicted sites of cleavage in the presence of the 
appropriate annealed primers are indicated by the arrows. The A- and T-hairpins were 
designed to prevent intra-strand mis-folding by omitting most of the T residues in the 
A-hairpin and omitting most of the A residues in the T-hairpin. To avoid mis-priming 
and slippage, the hairpins were designed with local variations in the sequence motifs 
(e.g., spacing T residues one or two nucleotides apart or in pairs). The A- and T- 
20 hairpins can be annealed together to form a duplex which has appropriate ends for 

directional cloning in pUC-type vectors; restriction sites are located in the loop regions 
of the duplex and can be used to elongate the stem regions if desired. 

The sequence of the test trigger oligonucleotide is shown in Figure 20B; this 
oligonucleotide is termed the alpha primer (SEQ ID NO:25). The alpha primer is 
25 complementary to the 3' arm of the T-hairpin as shown in Figure 20A. When the 

alpha primer is annealed to the T-hairpin, a cleavage structure is formed that is 
recognized by thermostable DNA polymerases. Cleavage of the T-hairpin liberates the 
5' single-stranded arm of the T-hairpin, generating the tau primer (SEQ ID NO:26) 
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and a cleaved T-hairpin (Figure 20B; SEQ ID NO:27). The tau primer is 
complementary to the 3' arm of the A-hairpin as shown in Figure 20A. Annealing of 
the tau primer to the A-hairpin generates another cleavage structure; cleavage of this 
second cleavage structure liberates the 5' single- stranded arm of the A-hairpin, 
5 generating another molecule of the alpha primer which then is annealed to another 

molecule of the T-hairpin. Thermocycling releases the primers so they can function in 
additional cleavage reactions. Multiple cycles of annealing and cleavage are carried 
out. The products of the cleavage reactions are primers and the shortened hairpin 
structures shown in Figure 20C. The shortened or cleaved hairpin structures may be 
10 resolved from the uncleaved hairpins by electrophoresis on denaturing acrylamide gels. 

The annealing and cleavage reactions are carried as follows: In a 50 ul 
reaction volume containing 10 mM Tris-Cl, pH 8.5, 1.0 MgCl 2 , 75 mM KC1, 1 pmole 
of A-hairpin, 1 pmole T-hairpin, the alpha primer is added at equimolar amount 
relative to the hairpin structures (1 pmole) or at dilutions ranging from 10- to 10 6 -fold 
15 and 0.5 |al of extract containing a 5' nuclease (prepared as described above) are added. 

The predicted melting temperature for the alpha or trigger primer is 60°C in the above 
buffer. Annealing is performed just below this predicted melting temperature at 55°C. 
Using a Perkin Elmer DNA Thermal Cycler, the reactions are annealed at 55°C for 30 
seconds. The temperature is then increased slowly over a five minute period to 72°C 
20 to allow for cleavage. After cleavage, the reactions are rapidly brought to 55°C (1°C 

per second) to allow another cycle of annealing to occur. A range of cycles are 
performed (20, 40 and 60 cycles) and the reaction products are analyzed at each of 
these number of cycles. The number of cycles which indicates that the accumulation 
of cleaved hairpin products has not reached a plateau is then used for subsequent 
25 determinations when it is desirable to obtain a quantitative result. 

Following the desired number of cycles, the reactions are stopped at 55°C by 
the addition of 8 p.1 of 95% formamide with 20 mM EDTA and 0.05% marker dyes 
per 10 |al reaction volume. Samples are not heated before loading onto denaturing 
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polyacrylamide gels (10% polyacrylamide, 19:1 crosslinking, 7 M urea, 89 mM tris- 
borate, pH 8.3, 2.8 mM EDTA). The samples were not heated to allow for the 
resolution of single-stranded and re-duplexed uncleaved hairpin molecules. 

The hairpin molecules may be attached to separate solid support molecules, 
5 such as agarose, styrene or magnetic beads, via the 3' end of each hairpin. A spacer 

molecule may be placed between the 3' end of the hairpin and the bead if so desired. 
The advantage of attaching the hairpins to a solid support is that this prevents the 
hybridization of the A- and T-hairpins to one another during the cycles of melting and 
annealing. The A- and T-hairpins are complementary to one another (as shown in 
10 Figure 20D) and if allowed to anneal to one another over their entire lengths this 

would reduce the amount of hairpins available for hybridization to the alpha and tau 
primers during the detection reaction. 

The 5' nucleases of the present invention are used in this assay because they 
lack significant synthetic activity. The lack of synthetic activity results in the 
15 production of a single cleaved hairpin product (as shown in Figure 19B, lane 4). 

Multiple cleavage products may be generated by 1) the presence of interfering 
synthetic activity (see Figure 19B, lanes 6 and 8) or 2) the presence of primer- 
independent cleavage in the reaction. The presence of primer-independent cleavage is 
detected in the trigger/detection assay by the presence of different sized products at the 
20 fork of the cleavage structure. Primer-independent cleavage can be dampened or 

repressed, when present, by the use of uncleavable nucleotides in the fork region of the 
hairpin molecule. For example, thiolated nucleotides can be used to replace several 
nucleotides at the fork region to prevent primer-independent cleavage. 

EXAMPLE 5 

25 Cleavage Of Linear Nucleic Acid Substrates 

From the above, it should be clear that native (i.e., "wild type") thermostable 
DNA polymerases are capable of cleaving hairpin structures in a specific manner and 
that this discovery can be applied with success to a detection assay. In this example, 
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the mutant DNAPs of the present invention are tested against three different cleavage 
structures shown in Figure 22A. Structure 1 in Figure 22A is simply single stranded 
206-mer (the preparation and sequence information for which was discussed above). 
Structures 2 and 3 are duplexes; structure 2 is the same hairpin structure as shown in 
5 Figure 12A (bottom), while structure 3 has the hairpin portion of structure 2 removed. 

The cleavage reactions comprised 0.01 pmoles of the resulting substrate DNA, 
and 1 pmole of pilot oligonucleotide in a total volume of 10 ^1 of 10 mM Tris-Cl, pH 
8.3, 100 mM KC1, 1 mM MgCl 2 . Reactions were incubated for 30 minutes at 55°C, 
and stopped by the addition of 8 ul of 95% formamide with 20 mM EDTA and 0.05% 
10 marker dyes. Samples were heated to 75°C for 2 minutes immediately before 

electrophoresis through a 10% polyacrylamide gel (19:1 cross link), with 7M urea, in a 
buffer of 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA. 

The results were visualized by autoradiography and are shown in Figure 22B 
with the enzymes indicated as follows: I is native Tag DNAP; II is native Tfl DNAP; 
15 III is the Cleavase™ BX enzyme shown in Figure 4E; IV is the Cleavase™ BB 

enzyme shown in Figure 4F; V is the mutant shown in Figure 5B; and VI is the 
Cleavase™ BN enzyme shown in Figure 4G. Structure 2 was used to "normalize" the 
comparison. For example, it was found that it took 50 ng of Tag DNAP and 300 ng 
of the Cleavase™ BN enzyme to give similar amounts of cleavage of Structure 2 in 
20 thirty (30) minutes. Under these conditions native Tag DNAP is unable to cleave 

Structure 3 to any significant degree. Native Tfl DNAP cleaves Structure 3 in a 
manner that creates multiple products. 

By contrast, all of the mutants tested cleave the linear duplex of Structure 3. 
This finding indicates that this characteristic of the mutant DNA polymerases is 
25 consistent of thermostable polymerases across thermophilic species. 

The finding described herein that the mutant DNA polymerases of the present 
invention are capable of cleaving linear duplex structures allows for application to a 
more straightforward assay design (Figure 1A). Figure 23 provides a more detailed 
schematic corresponding to the assay design of Figure 1A. 
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The two 43-mers depicted in Figure 23 were synthesized by standard methods. 
Each included a fluorescein on the 5 'end for detection purposes and a biotin on the 
3 'end to allow attachment to streptavidin coated paramagnetic particles (the biotin- 
avidin attachment is indicated by " ")• 
5 Before the trityl groups were removed, the oligos were purified by HPLC to 

remove truncated by-products of the synthesis reaction. Aliquots of each 43-mer were 
bound to M-280 DYNABEADS (Dynal) at a density of 100 pmoles per mg of beads. 
Two (2) mgs of beads (200 ul) were washed twice in IX wash/bind buffer (1 M NaCl, 
5 mM Tris-Cl, pH 7.5, 0.5 mM EDTA) with 0.1% BSA, 200 ul per wash. The beads 
10 were magnetically sedimented between washes to allow supernatant removal. After the 

second wash, the beads were resuspended in 200 ul of 2X wash/bind buffer (2 M Na 
CI, 10 mM Tris-Cl, pH 7.5 with 1 mM EDTA), and divided into two 100 ul aliquots. 
Each aliquot received 1 ul of a 100 uM solution of one of the two oligonucleotides. 
After mixing, the beads were incubated at room temperature for 60 minutes with 
15 occasional gentle mixing. The beads were then sedimented and analysis of the 

supernatants showed only trace amounts of unbound oligonucleotide, indicating 
successful binding. Each aliquot of beads was washed three times, 100 ul per wash, 
with IX wash/bind buffer, then twice in a buffer of 10 mM Tris-Cl, pH 8.3 and 75 
mM KC1. The beads were resuspended in a final volume of 100 ul of the Tns/KCl, 
20 for a concentration of 1 pmole of oligo bound to 10 ug of beads per ul of suspension. 

The beads were stored at 4°C between uses. 

The types of beads correspond to Figure 1A. That is to say, type 2 beads 
contain the oligo (SEQ ID NO:33) comprising the complementary sequence (SEQ ID 
NO:34) for the alpha signal oligo (SEQ ID NO:35) as well as the beta signal oligo 
25 (SEQ ID NO:36) which when liberated is a 24-mer. This oligo has no "As" and is "T" 

rich. Type 3 beads contain the oligo (SEQ ID NO:37) comprising the complementary 
sequence (SEQ ID NO:38) for the beta signal oligo (SEQ ID NO:39) as well as the 
alpha signal oligo (SEQ ID NO:35) which when liberated is a 20-mer. This oligo has 
no "Ts" and is "A" rich. 
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Cleavage reactions comprised 1 ul of the indicated beads, 10 pmoles of 
unlabeled alpha signal oligo as "pilot" (if indicated) and 500 ng of the Cleavase™ BN 
enzyme in 20 ul of 75 mM KC1, 10 mM Tris-Cl, P H 8.3, 1.5 mM MgCl 2 and 10 uM 
CTAB. All components except the enzyme were assembled, overlaid with light 
5 mineral oil and warmed to 53°C The reactions were initiated by the addition of 

prewarmed enzyme and incubated at that temperature for 30 minutes. Reactions were 
stopped at temperature by the addition of 16 ^1 of 95% formamide with 20 mM 
EDTA and 0.05% each of bromophenol blue and xylene cyanol. This addition stops 
the enzyme activity and, upon heating, disrupts the biotin-avidin link, releasing the 
10 majority (greater than 95%) of the oligos from the beads. Samples were heated to 

75°C for 2 minutes immediately before electrophoresis through a 10% polyacrylamide 
gel (19:1 cross link), with 7 M urea, in a buffer of 45 mM Tris-Borate, pH 8.3, 1.4 
mM EDTA. Results were visualized by contact transfer of the resolved DNA to 
positively charged nylon membrane and probing of the blocked membrane with an 
15 anti-fluorescein antibody conjugated to alkaline phosphatase. After washing, the signal 

was developed by incubating the membrane in Western Blue (Promega) which deposits 
a purple precipitate where the antibody is bound. 

Figure 24 shows the propagation of cleavage of the linear duplex nucleic acid 
structures of Figure 23 by the DNAP mutants of the present invention. The two center 
20 lanes contain both types of beads. As noted above, the beta signal oligo (SEQ ID 

NO:36) when liberated is a 24-mer and the alpha signal oligo (SEQ ID NO:35) when 
liberated is a 20-mer. The formation of the two lower bands corresponding to the 24- 
mer and 20-mer is clearly dependent on "pilot". 

EXAMPLE 6 

25 5' Exonucleolytic Cleavage ("Nibbling") By Thermostable DNAPs 

It has been found that thermostable DNAPs, including those of the present 
invention, have a true 5' exonuclease capable of nibbling the 5' end of a linear duplex 
nucleic acid structures. In this example, the 206 base pair DNA duplex substrate is 
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again employed (see above). In this case, it was produced by the use of one 32 P- 
labeled primer and one unlabeled primer in a polymerase chain reaction. The cleavage 
reactions comprised 0.01 pmoles of heat-denatured, end-labeled substrate DNA (with 
the unlabeled strand also present), 5 pmoles of pilot oligonucleotide (see pilot oligos in 
Figure 12A) and 0.5 units of DNAPTaq or 0.5 p. of the Cleavase™ BB enzyme in the 
E. coli extract (see above), in a total volume of 10 pi of 10 mM Tris-Cl, pH 8.5, 50 

mM KC1, 1.5 mM MgCl 2 . 

Reactions were initiated at 65°C by the addition of pre-warmed enzyme, then 
shifted to the final incubation temperature for 30 minutes. The results are shown in 
Figure 25 A. Samples in lanes 1-4 are the results with native Tag DNAP, while lanes 
5-8 shown the results with the Cleavase™ BB enzyme. The reactions for lanes 1, 2, 
5, and 6 were performed at 65°C and reactions for lanes 3, 4, 7, and 8 were performed 
at 50°C and all were stopped at temperature by the addition of 8 pi of 95% formamide 
with 20 mM EDTA and 0.05% marker dyes. Samples were heated to 75°C for 2 
minutes immediately before electrophoresis through a 10% acrylamide gel (19:1 cross- 
linked), with 7 M urea, in a buffer of 45 mM Tris'Borate, pH 8.3, 1.4 mM EDTA. 
The expected product in reactions 1, 2, 5, and 6 is 85 nucleotides long; in reactions 3 
and 7, the expected product is 27 nucleotides long. Reactions 4 and 8 were performed 
without pilot, and should remain at 206 nucleotides. The faint band seen at 24 
nucleotides is residual end-labeled primer from the PCR. 

The surprising result is that the enzyme Cleavase™ BB under these conditions 
causes all of the label to appear in a very small species, suggesting the possibility that 
the enzyme completely hydrolyzed the substrate. To determine the composition of the 
fastest-migrating band seen in lanes 5-8 (reactions performed with the deletion 
mutant), samples of the 206 base pair duplex were treated with either T7 gene 6 
exonuclease (USB) or with calf intestine alkaline phosphatase (Promega), according to 
manufacturers' instructions, to produce either labeled mononucleotide (lane a of Figure 
25B) or free 32 P-labeled inorganic phosphate (lane b of Figure 25B), respectively. 
These products, along with the products seen in lane 7 of panel A were resolved by 
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brief electrophoresis through a 20% acrylamide gel (19:1 cross-link), with 7 M urea, ii 
a buffer of 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA. The enzyme Cleavase™ BB 
is thus capable of converting the substrate to mononucleotides. 



The nibbling by the Cleavase™ BB enzyme is duplex dependent. In this 
example, internally labeled, single strands of the 206-mer were produced by 15 cycles 
of primer extension incorporating cc- 32 P labeled dCTP combined with all four 
unlabeled dNTPs, using an unlabeled 206-bp fragment as a template. Single and 
double stranded products were resolved by electrophoresis through a non-denaturing 
6% polyacrylamide gel (29:1 cross-link) in a buffer of 45 mM Tris-Borate, pH 8.3, 1.4 
mM EDTA, visualized by autoradiography, excised from the gel, eluted by passive 
diffusion, and concentrated by ethanol precipitation. 

The cleavage reactions comprised 0.04 pmoles of substrate DNA, and 2 ul of 
the Cleavase™ BB enzyme (in an E. coli extract as described above) in a total volume 
of 40 |il of 10 mM Tris'Cl, pH 8.5, 50 mM KC1, 1.5 mM MgCl 2 . Reactions were 
initiated by the addition of pre-warmed enzyme; 10 ^1 aliquots were removed at 5, 10, 
20, and 30 minutes, and transferred to prepared tubes containing 8 ul of 95% 
formamide with 30 mM EDTA and 0.05% marker dyes. Samples were heated to 75°C 
for 2 minutes immediately before electrophoresis through a 10% acrylamide gel (19:1 
cross-linked), with 7 M urea, in a buffer of 45 mM Tris»Borate, pH 8.3, 1.4 mM 
EDTA. Results were visualized by autoradiography as shown in Figure 26. Clearly, 
the cleavage by the Cleavase™ BB enzyme depends on a duplex structure; no cleavage 
of the single strand structure is detected whereas cleavage of the 206-mer duplex is 
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complete. 
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EXAMPLE 8 

Nibbling Can Be Target Directed 



The nibbling activity of the DNAPs of the present invention can be employed 
with success in a detection assay. One embodiment of such an assay is shown in 
Figure 27. In this assay, a labelled oligo is employed that is specific for a target 
sequence. The oligo is in excess of the target so that hybridization is rapid. In this 
embodiment, the oligo contains two fluorescein labels whose proximity on the oligo 
causes their emission to be quenched. When the DNAP is permitted to nibble the 
oligo the labels separate and are detectable. The shortened duplex is destabilized and 
disassociates. Importantly, the target is now free to react with an intact labelled oligo. 
The reaction can continue until the desired level of detection is achieved. An 
analogous, although different, type of cycling assay has been described employing 
lambda exonuclease. See C.G. Copley and C. Boot, BioTechniques 13:888 (1992). 

The success of such an assay depends on specificity. In other words, the oligo 
must hybridize to the specific target. It is also preferred that the assay be sensitive; 
the oligo ideally should be able to detect small amounts of target. Figure 28A shows a 
5 '-end 32 P-labelled primer bound to a plasmid target sequence. In this case, the 
plasmid was pUC19 (commercially available) which was heat denatured by boiling two 
(2) minutes and then quick chilling. The primer is a 21-mer (SEQ ID NO:39). The 
Cleavase™ BX enzyme (a dilution equivalent to 5 x 10" 3 ul extract) was employed in 
100 mM KC1, 10 mM Tris-Cl, pH 8.3, 2 mM MnCl 2 . The reaction was performed at 
55°C for sixteen (16) hours with or without genomic background DNA (from chicken 
blood). The reaction was stopped by the addition of 8 ul of 95% formamide with 20 
mM EDTA and marker dyes. 

The products of the reaction were resolved by PAGE (10% polyacrylamide, 
19:1 cross link, 1 x TBE) as seen in Figure 28B. Lane "NT contains the labelled 21- 
mer. Lanes 1-3 contain no specific target, although Lanes 2 and 3 contain 100 ng and 
200 ng of genomic DNA, respectively. Lanes 4, 5 and 6 all contain specific target 
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with either 0 ng, 100 ng or 200 ng of genomic DNA, respectively. It is clear that 
conversion to mononucleotides occurs in Lanes 4, 5 and 6 regardless of the presence 
or amount of background DNA. Thus, the nibbling can be target directed and specific. 

EXAMPLE 9 

5 Purification Of The Cleavase™ Enzyme 

As noted above, expressed thermostable proteins, i.e., the 5' nucleases, were 
isolated by crude bacterial cell extracts. The precipitated E. coli proteins were then, 
along with other cell debris, removed by centrifugation. In this example, cells 
expressing the BN clone were cultured and collected (500 grams). For each gram (wet 
10 weight) of E. coli, 3ml of lysis buffer (50 mM Tris-HCl, pH 8.0, 1 mM EDTA, 

100p.M NaCl) was added. The cells were lysed with 200 ug/ml lysozyme at room 
temperature for 20 minutes. Thereafter deoxycholic acid was added to make a 0.2% 
final concentration and the mixture was incubated 15 minutes at room temperature. 

The lysate was sonicated for approximately 6-8 minutes at 0°C. The precipitate 
15 was removed by centrifugation (39,000g for 20 minutes). Polyethyleneimine was 

added (0.5%) to the supernatant and the mixture was incubated on ice for 15 minutes. 
The mixture was centrifuged (5,000g for 15 minutes) and the supernatant was retained. 
This was heated for 30 minutes at 60°C and then centrifuged again (5,000g for 15 
minutes) and the supernatant was again retained. 
20 The supernatant was precipitated with 35% ammonium sulfate at 4°C for 15 

minutes. The mixture was then centrifuged (5,000g for 15 minutes) and the 
supernatant was removed. The precipitate was then dissolved in 0.25 M KC1, 20 mM 
Tris, pH 7.6, 0.2% TWEEN and 0.1 EDTA) and then dialyzed against Binding Buffer 
(8X Binding Buffer comprises: 40mM imidazole, 4M NaCl, 160 mM Tris-HCl, 
25 pH 7.9). 
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The solubilized protein is then purified on the Ni" column (Novagen). The 
Binding Buffer is allows to drain to the top of the column bed and load the column 
with the prepared extract. A flow rate of about 10 column volumes per hour is 
optimal for efficient purification. If the flow rate is too fast, more impurities will 
contaminate the eluted fraction. 

The column is washed with 25 ml (10 volumes) of IX Binding Buffer and then 
washed with 15 ml (6 volumes) of IX Wash Buffer (8X Wash Buffer comprises: 
480mM imidazole, 4M NaCl, 160 mM Tris-HCl, P H 7.9). The bound protein was 
eluted with 15ml (6 volumes) of IX Elute Buffer (4X Elute Buffer comprises: 4mM 
imidazole, 2M NaCl, 80 mM Tris-HCl, P H 7.9). Protein is then reprecipitated with 
35% Ammonium Sulfate as above. The precipitate was then dissolved and dialyzed 
against: 20 mM Tris, 100 mM KC1, ImM EDTA). The solution was brought up to 
0.1% each of TWEEN 20 and NP-40 and stored at 4°C. 

EXAMPLE 10 

5' Nucleases Cut Nucleic Acid Substrates At 
Naturally Occurring Areas Of Secondary Structure 



The ability of a 5' nuclease to recognize and cleave nucleic acid substrates at 
naturally occurring areas of secondary structure in the absence of a pilot 
oligonucleotide {i.e., primer independent cleavage) was shown in Example 1C 
20 (Figure 12, lane 9). When DNAPTaq was incubated at 50°C in the presence of a 206 

bp DNA substrate (single end labeled, double stranded template) in a buffer containing 
10 mM Tris-HCl, pH 8.5 and 1.5 mM MgCl 2 , adventitious {i.e., naturally occurring) 
structures in the DNA substrate were cleaved by the 5' nuclease activity of the 
enzyme. This cleavage generated three prominent fragments (Figure 12, lane 9); this 
25 cleavage pattern provides a "fingerprint" of the DNA template. 

The ability of 5' nucleases to cleave naturally occurring structures in nucleic 
acid templates (structure-specific cleavage) is useful to detect internal sequence 
differences in nucleic acids without prior knowledge of the specific sequence of the 
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nucleic acid. To develop a general method to scan nucleic acids for mutations [e.g., 
single base changes (point mutations), small insertions or deletions, etc.] using 5' 
nucleases, the following series of experiments were performed. 

A. The Substitution Of MnCl 2 For MgCl 2 In The 
5 Cleavage Reaction Produces Enhanced Cleavage 

Patterns 

The effect of substituting of Mn 2+ in place of Mg 2+ upon the cleavage pattern 
created by 5' nuclease activity on a double-stranded DNA substrate was examined. A 
157 bp fragment derived from exon 4 of either the wild-type (SEQ ID NO:40) or the 
10 mutant G419R (SEQ ID NO:41) tyrosinase gene was prepared by PCR as follows. 

The primer pair 5' biotin-C ACCGT CCTCTTC AAG AAG 3' (SEQ ID NO:42) 
and 5' fluorescein-CTGAATCTTGTAGATAGCTA 3' (SEQ ID NO:43) was used to 
prime the PCRs. The synthetic primers were obtained from Promega; the primers 
were labeled on the 5' end with biotin or fluorescein during synthesis. 
15 The target DNA for the generation of the 157 bp fragment of mutant G419R 

(King, R.A., et al, (1991) Mol. Biol. Med. 8:19; here after referred to as the 419 
mutant) was a 339 bp PCR product (SEQ ID NO:44) generated using genomic DNA 
homozygous for the 419 mutation. Genomic DNA was isolated using standard 
techniques from peripheral blood leukocytes isolated from patients. This 339 bp PCR 
20 product was prepared as follows. 

The symmetric PCR reaction comprised 10 ng of genomic DNA from the 419 
mutant, 100 pmoles of the primer 5' biotin-GCCTT ATTTT ACTTT AAAAAT-3 ' (SEQ 
ID NO:45), 100 pmoles of the primer 5' fluorescein-TAAAGTTTTGTGTTATCTCA- 
3' (SEQ ID NO:46), 50 uM of each dNTP, 20 mM Tris-Cl, pH 8.3, 1.5 mM MgCl 2 , 
25 50 mM KC1, with 0.05% TWEEN 20 and 0.05% NONIDET P-40 (NP40). The 

primers of SEQ ID NOS:45 and 46 were obtained from Integrated DNA Technologies, 
Coralville, IA. A tube containing 45 ul of the above mixture was overlaid with two 
drops of light mineral oil and the tube was heated to 95°C for 1 min. Tag polymerase 
was then added as 1.25 units of enzyme in 5 ul of 20 mM Tris-Cl, pH 8.3, 1.5 mM 
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MgCl 2 , 50 mM KC1, with 0.05% TWEEN 20 and 0.05% NONIDET P-40. The tube 
was heated to 94°C for 40 sec, cooled to 55°C for 50 sec, heated to 72°C for 70 sec 
for 29 repetitions with a 5 min incubation at 72°C after the last repetition. 

The PCR products were gel purified as follows. The products were resolved by 
5 electrophoresis through a 6% polyacrylamide gel (29:1 cross-link) in a buffer 

containing 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA. The DNA was visualized by 
ethidium bromide staining and the 339 bp fragment was excised from the gel. The 
DNA was eluted from the gel slice by passive diffusion overnight into a solution 
containing 0.5 M NH 4 OAc, 0.1% SDS and 0.1 M EDTA. The DNA was then 
10 precipitated with ethanol in the presence of 4 jxg of glycogen carrier. The DNA was 

pelleted and resuspended in 40 ul of TE (10 mM Tris-Cl, pH 8.0, 0.1 mM EDTA). 

To generate the 157 bp fragment from the 419 mutant, the purified 339 bp 419 
PCR fragment was used as the target in an asymmetric PCR. The asymmetric PCR 
comprised 100 pmoles of the biotinylated primer of SEQ ID NO:45, 1 pmole of the 
15 fluoresceinated primer of SEQ ID NO:46, 50 uM of each dNTP, 20 mM Tris-Cl, pH 

8.3, 1.5 mM MgCl 2 , 50 mM KC1, with 0.05% TWEEN 20 and 0.05% NONIDET P- 
40. A tube containing 45 ul of the above mixture was overlaid with two drops of 
light mineral oil and the tube was heated to 95°C for 5 sec and then cooled to 70°C. 
Tag polymerase was then added as 1.25 units of enzyme in 5 »\ of 20 mM Tris-Cl, pH 
20 8.3, 1.5 mM MgCl 2 , 50 mM KC1, with 0.05% TWEEN 20 and 0.05% NONIDET P- 

40.' The tube was heated to 95°C for 45 sec, cooled to 50*C for 45 sec, heated to 
72°C for 1 min 15 sec for 30 repetitions with a 5 min incubation at 72°C after the 
last repetition. 

The asymmetric PCR products were gel purified as follows. The products were 
25 resolved by electrophoresis through a 6% polyacrylamide gel (29:1 cross-link) in a 

buffer containing 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA. The DNA was 
visualized by ethidium bromide staining; the double-stranded DNA was differentiated 
from the single- stranded DNA due to the mobility shift commonly seen with single- 
stranded DNA produced from asymmetric PCR (In an asymmetric PCR both single- 
30 stranded and double-stranded products are produced; typically the single-stranded 
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product will have a slower speed of migration through the gel and will appear closer 
to the origin than will the double- stranded product). The double-stranded 157 bp 
substrate corresponding to the 419 mutant (SEQ ID NO:41) was excised from the gel. 
The 157 bp wild-type fragment was generated by asymmetric PCR as described 
5 above for the 419 mutant with the exception that the target DNA was 10 ng of 

supercoiled pcTYR-NlTyr plasmid DNA. The pcTYR-NlTyr plasrmd contains the 
entire wild-type tyrosinase cDNA [Geibel, L.B., et al. (1991) Genomics 9:435]. 

Following the asymmetric PCRs, the reaction products were resolved on an 
acrylamide gel and the double-stranded fragments of interest were excised, eluted and 
10 precipitated as described above. The precipitated 157 bp wild-type (SEQ ID NO:40) 

and 419 mutant (SEQ ID NO:41) fragments were resuspended in 40 ul of TE. 

Cleavage reactions comprised 100 fmoles of the resulting double-stranded 
substrate DNAs (the substrates contain a biotin moiety at the 5' end of the sense 
strand) in a total volume of 10 ul of 10 mM MOPS, pH 8.2, 1 mM divalent cation 
15 (either MgCl 2 or MnCl 2 ) and 1 unit of DNAPTaq. The reactions were overlaid with a 

drop of light mineral oil. Reactions were heated to 95°C for 5 seconds to denature the 
substrate and then the tubes were quickly cooled to 65°C (this step allows the DNA 
assume its unique secondary structure by allowing the formation of intra-strand 
hydrogen bonds between complimentary bases). The reaction can be performed in 
20 either a thermocycler (MJ Research, Watertown, MA) programmed to heat to 95°C for 

5 seconds then drop the temperature immediately to 65°C or alternatively the tubes can 
be placed manually in a heat block set at 95°C and then transferred to a second heat 
block set at 65°C. 

The reaction was incubated at 65°C for 10 minutes and was stopped by the 
25 addition of 8ul of stop buffer (95% formamide containing 20 mM EDTA and 0.05% 

each xylene cyanol and bromophenol blue). Samples were heated to 72°C for 2 
minutes and 5ul of each reaction were resolved by electrophoresis through a 10% 
polyacrylamide gel (19:1 cross-link), with 7 M urea, in a buffer containing 45 mM 
Tris-Borate, pH 8.3, 1.4 mM EDTA. 
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After electrophoresis, the gel plates were separated allowing the gel to remain 
flat on one plate. A 0.2 ^m-pore positively-charged nylon membrane (Schleicher and 
Schuell, Keene, NH), pre-wetted in 0.5X TBE (45 mM Tris-Borate, P H 8.3, 1.4 mM 
EDTA), was laid on top of the exposed acrylamide gel. All air bubbles trapped 
between the gel and the membrane were removed. Two pieces of 3MM filter paper 
(Whatman) were then placed on top of the membrane, the other glass plate was 
replaced, and the sandwich was clamped with binder clips. Transfer was allowed to 
proceed overnight. After transfer, the membrane was carefully peeled from the gel 
and allowed to air dry. After complete drying, the membrane was washed in 1.2X 
Sequenase Images Blocking Buffer (United States Biochemical) for 30 minutes. Three 
tenths of a ml of the buffer was used per cm 2 of membrane. A streptavidin-alkaline 
phosphatase conjugate (SAAP, United States Biochemical) was added to a 1:4000 
dilution directly to the blocking solution, and agitated for 15 minutes. The membrane 
was rinsed briefly with H 2 0 and then washed 3 times (5 minutes/wash) in IX SAAP 
buffer (100 mM Tris-HCL, pH 10; 50 mM NaCl) with 0.1% sodium dodecyl sulfate 
(SDS) using 0.5 ml buffer/cm 2 of the buffer, with brief H 2 0 rinses between each wash. 
Similarly, for fluorescein-labeled DNA, anti-fluorescein fragment (Boehringer 
Mannheim Biochemicals, Indianapolis, IN) at a 1:20,000 final dilution may be added 
followed by three washes (5 min/wash) in IX SAAP buffer containing 0.1% SDS and 
0.025% TWEEN 20. The membrane was then washed once in IX SAAP buffer 
without SDS, drained thoroughly and placed in a plastic heat-sealable bag. Using a 
sterile pipet tip, 0.05 ml/cm 2 of CDP-Star™ (Tropix, Bedford, MA) was added to the 
bag and distributed over the entire membrane for 5 minutes. The bag was drained of 
all excess liquid and air bubbles. The membrane was then exposed to X-ray film 
25 (Kodax XRP) for an initial 30 minutes. Exposure times were adjusted as necessary for 

resolution and clarity. The results are shown in Figure 30. 

In Figure 30, the lane marked "M" contains molecular weight markers. The 
marker fragments were generated by digestion of P UC19 with Haelll followed by the 
addition of biotinylated dideoxynucleotides (Boehringer Mannheim, Indianapolis, IN) 
30 to the cut ends using terminal transferase (Promega). Lanes 1, 3 and 5 contain the 
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reaction products from the incubation of the wild type 157 nucleotide substrate in the 
absence of the DNAP Taq enzyme (lane 1), in the presence of MgCl 2 and enzyme (lane 
3) or in the presence of MnCl 2 and enzyme (lane 5). Lanes 2, 4 and 6 contains the 
reaction products from the incubation of the 157 nucleotide substrate derived from the 
5 419 mutant in the absence of enzyme (lane 2), in the presence of MgCl 2 and enzyme 

(lane 4) or in the presence of MnCl 2 and enzyme (lane 6). 

Figure 30 demonstrates that the use of MnCl 2 rather than MgCl 2 in the cleavage 
reaction results in the production of an enhanced cleavage pattern. It is desirable that 
the cleavage products are of different sizes so that the products do not all cluster at 
10 one end of the gel. The ability to spread the cleavage products out over the entire 

length of the gel makes it more likely that alterations in cleavage products between the 
wild type and mutant substrates will be identified. Figure 30 shows that when Mg 2+ is 
used as the divalent cation, the majority of the cleavage products cluster together in 
the upper portion of the gel. In contrast when Mn 2+ is used as the divalent cation, the 
15 substrate assumes structures which, when cleaved, generate products of widely 

differing mobilities. These results show that Mn 2+ is the preferred divalent cation for 
the cleavage reaction. 

B. 5' Nuclease Cleavage Of Different But Similarly Sized 
DNAs Generates Unique Cleavage Fragments 

20 The ability of 5' nuclease to generate a cleavage pattern or "fingerprint" which 

is unique to a given piece of DNA was shown by incubating four similarly sized DNA 
substrates with the Cleavase™ BN enzyme. The four DNA substrates used were a 157 
nucleotide fragment from the sense (or coding) strand of exon 4 of the wild-type 
tyrosinase gene (SEQ ID NO:47); a 157 nucleotide fragment from the anti-sense (or 

25 non-coding) strand of exon 4 of the wild-type tyrosinase gene (SEQ ID NO:48); a 165 

nucleotide DNA fragment derived from P GEM3Zf(+) (SEQ ID NO:49) and a 206 
nucleotide DNA fragment derived from the bottom strand of pGEM3Zf(+) (SEQ ID 
NO:50). The DNA substrates contained either a biotin or fluorescein label at their 5' 
or 3' ends. The substrates were made as follows. 
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To produce the sense and anti-sense single-stranded substrates corresponding to 
exon 4 of the wild-type tyrosinase gene, a double-stranded DNA fragment, 157 
nucleotides in length (SEQ ID NO:40), was generated using symmetric PCR. The 
target for the symmetric PCR was genomic DNA containing the wild-type tyrosinase 
gene. The symmetric PCR comprised 50-100 ng of genomic wild-type DNA, 25 
pmoles each of primers SEQ ID NOS:42 and 43, 50 uM each dNTP and 1.25 units of 
Taq polymerase in 50 ul of 20 mM Tris-Cl, P H 8.3, 1.5 mM MgCl 2 , 50 mM KC1, 
with 0.05% TWEEN 20 and 0.05% NONIDET P-40. The reaction mixture was 
overlaid with two drops of light mineral oil and the tube was heated to 94°C for 30 
sec, cooled to 50°C for 1 min, heated to 72°C for 2 min for 30 repetitions. The 
double-stranded PCR product was gel purified, precipitated and resuspended in 40 ul 
of TE buffer as described above in a). 

The single-stranded sense and anti-sense 157 nucleotide DNA fragments were 
generated using the above 157 bp wild-type DNA fragment (SEQ ID NO:40) in two 
asymmetric PCR reactions. The sense strand fragment was generated using 5 ul of the 
above purified 157 bp fragment (SEQ ID NO:40) as the target in an asymmetric PCR. 
The reaction mixtures for the asymmetric PCR were as above for the symmetric PCR 
with the exception that 100 pmoles of the biotin-labeled sense primer (SEQ ID NO:42) 
and 1 pmole of the fluorescein-labeled anti-sense primer (SEQ ID NO:43) was used to 
prime the reaction. The anti-sense fragment was generated using 5 ul of the above 
purified 157 bp fragment as the target in an asymmetric PCR. The reaction conditions 
for the asymmetric PCR were as above for the symmetric PCR with the exception that 
1 pmole of the sense primer (SEQ ID NO:42) and 100 pmoles of the anti-sense primer 
(SEQ ID NO:43) was used to prime the reaction. 

The reaction conditions for the asymmetric PCR were 95°C for 45 sec, 50°C 
for 45 sec, 72°C for 1 min and 15 sec for 30 repetitions with a 5 min incubation at 
72°C after the last repetition. The reaction products were visualized, extracted and 
collected as described above with the single stranded DNA being identified by a shift 
in mobility when compared to a double stranded DNA control. 
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The single-stranded 165 nucleotide fragment from pGEM3Zf(+) (SEQ ID 
NO:49) was generated by asymmetric PCR. The PCR comprised 50 pmoles of 5' 
biotin- AGCGG AT AAC AATTTC AC AC AGG A- 3 ' (SEQ ID NO:51; Promega) and 1 
pmole of 5 ' -C ACGGATCCT AAT ACGACTCACT AT AGGG-3 ' (SEQ ID NO:52; 
Integrated DNA Technologies, Coralville, IA), 50 uM each dNTP, 20 mM Tris-Cl, 
P H 8.3, 1.5 mM MgCl 2 , 50 mM KC1, with 0.05% TWEEN 20 and 0.05% NONIDET 
P-40. Forty-five microliters of this reaction mixture was overlaid with two drops of 
light mineral oil and the tube was heated to 95°C for 5 sec and then cooled to 70°C. 
Tag polymerase was then added at 1.25 units in 5 ul of 20 mM Tris-Cl, pH 8.3, 1.5 
mM MgCl 2 , 50 mM KC1, with 0.05% TWEEN 20 and 0.05% NONIDET P-40. The 
tubes were heated to 95°C for 45 sec, cooled to 50°C for 45 sec, heated to 72°C for 
1 min 15 sec for 30 repetitions with a 5 min incubation at 72°C after the last 
repetition. The reaction products were visualized, extracted and collected as described 
above with the 164 nucleotide DNA fragment being identified by a shift in mobility 
when compared to a double stranded DNA control. 

The 206 nucleotide DNA fragment (SEQ ID NO:50) was prepared by 
asymmetric as follows. The asymmetric PCR comprised 1 pmole of a double-stranded 
206 bp PCR product (generated as described in Example 1C), 50 pmoles of the primer 
5 ' -CGCC AGGGTTTTCCC AGTCACGAC-3 ' (SEQ ID NO:53), 50 uM each dNTP, 20 
mM Tris-Cl, pH 8.3, 1.5 mM MgCl 2 , 50 mM KC1, with 0.05% TWEEN 20 and 0.05% 
NONIDET P-40. Ninety-five microliters of this reaction mixture was overlaid with 
three drops of light mineral oil and the tube was heated to 95°C for 5 sec and then 
cooled to 70°C. Tag polymerase was then added at 2.5 units in 5 ul of 20 mM Tris- 
Cl, pH 8.3, 1.5 mM MgCl 2 , 50 mM KC1, with 0.05% TWEEN 20 and 0.05% 
NONIDET P-40. The tubes were heated to 95°C for 45 sec, cooled to 63°C for 45 
sec, heated to 72°C for 1 min 15 sec for 15 repetitions with a 5 min incubation at 
72°C after the last repetition. The reaction products were visualized, extracted and 
collected as described above with the 206 nucleotide DNA fragment being identified 
by a shift in mobility when compared to a double stranded DNA control. The 
precipitated DNA was resuspended in 70 ul of TE buffer. 
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Twenty-five microliters of the above product was biotinylated on the 3' end 
using 10-20 units of terminal deoxynucleotidyl transferase (Promega) in a 50 ul 
reaction. The reaction comprised 0.5 nmoles of biotin-16-ddUTP (Boehringer 
Mannheim) and IX buffer (500 mM cacoodylate buffer, pH 6.8, 5 mM CoCl 2 , 0.5 mM 
DTT and 500 p.g/ml BSA). The tubes were incubated at 37°C for 15 min followed by 
ethanol precipitation in the presence of 4 jig of glyc7ogen. The DNA was ethanol 
precipitated a second time and then resuspended in 25 ul of 10 Mm Tris-HCl, pH 8.0, 
0.1 mM EDTA. 

The cleavage reactions were carried out in a final volume of 10 ul containing 
IX CFLP™ buffer (10 mM MOPS, pH 8.2) with 1 mM MnCl 2 using approximately 
100 fmoles of substrate DNA and 250 ng of the Cleavase™ BN enzyme. Parallel 
reactions lacking the Cleavase™ BN enzyme (no enzyme control) were set up as 
above with the exception that one third as much DNA template was used 
(approximately 33 fmoles of each template) to balance the signal on the 

15 autoradio graph. 

Each substrate DNA was placed in a 200 ul thin wall microcentrifuge tube 
(BioRad, Hercules, CA) in 5ul of IX CFLP™ buffer with 2 mM MnCl 2 . The solution 
was overlaid with one drop of light mineral oil. Tubes were brought to 95°C for 5 
seconds to denature the substrates and then the tubes were quickly cooled to 65°C. 

20 Cleavage reactions were started immediately by the addition of a diluted 

enzyme mixture comprising 1 ul of the Cleavase™ BN enzyme [250 ng/ul in IX 
dilution buffer (0.5% NP40, 0.5% TWEEN20, 20 mM Tris-HCl, P H 8.0, 50 mM KC1, 
10 ug/ml BSA)] in 5ul of IX CFLP™ buffer without MnCl 2 . The enzyme solution 
was at room temperature before addition to the cleavage reaction. After 5 minutes at 

25 65°C, the reactions were stopped by the addition of 8ul of stop buffer. Samples were 

heated to 72°C for 2 minutes and 5^1 of each reaction were resolved by 
electrophoresis through a 10% polyacrylamide gel (19:1 cross-link), with 7 M urea, in 
a buffer containing 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA. 
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After electrophoresis, the gel plates were separated and overlaid with a 0.45 
Hm-pore positively charged nylon membrane (United States Biochemical). The DNA 
was transferred to the membrane and the membrane was dried, washed in 1.2X 
Sequenase IMAGES Blocking Buffer, treated with IX SAAP buffer as described 
above. The signal was developed using LUMIPHOS-530 (United States Biochemical) 
or Quantum Yield Chemiluminescent Substrate (Promega) in place of the CDP-Star™ ; 
the membrane was then exposed to X-ray film as described above. The resulting 
autoradiograph is shown in Figure 3 1 . 

Figure 31 shows the results of incubation of the four substrates described above 
in the presence or absence of the Cleavase™ BN enzyme. Four sets of reactions are 
shown. Set one contains the reaction products from the incubation of the 157 
nucleotide sense strand fragment of the tyrosinase gene (SEQ ID NO:47) in the 
absence or presence of the Cleavase™ BN enzyme. Set two contains the reaction 
products from the incubation of the 157 nucleotide anti-sense strand fragment of the 
15 tyrosinase gene (SEQ ID NO:48) in the absence or presence of the Cleavase™ BN 

enzyme. Set three contains the reaction products from the incubation of the 165 base 
bottom strand fragment of the plasmid pGEM3Zf(+) (SEQ ID NO:49) in the absence 
or presence of the Cleavase™ BN enzyme. Set four contains the reaction products 
from the incubation of the 206 base top strand fragment of the plasmid pGEM3Zf(+) 
(SEQ ID NO:50) in the absence or presence of the Cleavase™ BN enzyme. Lanes 
marked "M" contain biotin-labeled molecular weight markers prepared as described 
above; the sizes of the marker fragments are indicated in Figure 31. In the absence of 
the Cleavase™ BN enzyme, no cleavage of the substrates is observed. In the presence 
of theCleavase™ BN enzyme , each substrate is cleaved generating a unique set of 
cleavage products. When these cleavage products are resolved on a polyacrylamide 
gel, a unique pattern or fingerprint is seen for each substrate DNA. Thus, although the 
four substrates are similar in size (157 to 206 bases), the Cleavase™ BN enzyme 
generates a unique collection of cleavage products from each substrate. These unique 
cleavage patterns result from the characteristic conformation each substrate DNA 
30 assumes. 
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The present invention contemplates the ability to generate a unique cleavage 
pattern for two or more DNA substrates of the same size as part of a method for the 
detection of genetic mutations. This method compares a normal (or wild type or non- 
mutated) substrate with a substrate from a patient suspected of having a mutation in 
5 that substrate. The two substrates would be of the same length and the cleavage 

reaction would be used to probe the patient DNA substrate for conformational changes 
relative to the pattern seen in the wild type control substrate. 

EXAMPLE 11 

Cleavage Directed By The Cleavase™ BN Enzyme 
10 Can Detect Single Base Changes In DNA Substrates 

The ability of the Cleavase™ BN enzyme to cleave DNA substrates of the 
same size but which contain single base changes between the substrates is herein 
demonstrated. The human tyrosinase gene was chosen as a model system because 
numerous single point mutations have been identified in exon 4 of this gene [Spritz, 
15 R.A. (1994) Human Molecular Genetics 3:1469]. Mutation of the tyrosinase gene leads 

to oculocutaneous albinism in humans. 

Three single-stranded substrate DNAs were prepared; the substrates contain a 
biotin label at their 5' end. The wild type substrate comprises the 157 nucleotide 
fragment from the sense strand of the human tyrosinase gene [(SEQ ID NO:47); 
20 Geibel, L.B., et al. (1991) Genomics 9:435]. Two mutation-containing substrates were 

used. The 419 substrate (SEQ ID NO:54) is derived from the tyrosinase mutant 
G419R which contains a glycine (GGA) to arginine (AGA) substitution; this mutant 
differs from the wild-type exon 4 fragment by a single base change at nucleotide 2675 
[King, R.A., et al. (1991) Mol. Biol. Med. 8:19]. The 422 substrate (SEQ ID NO:55) 
25 is derived from the tyrosinase mutant R422Q which contains an arginine (CGG) to 

glutamine (CAG) substitution; this mutant differs from the wild type exon 4 fragment 
by a single base change at nucleotide 2685 [Giebel, L.B., et al. (1991) J. Clin. Invest. 
87:1119]. 
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Single-stranded DNA containing a biotin label at the 5' end was generated for 
each substrate using asymmetric PCR as described in Example 10a with the exception 
that the single-stranded PCR products were recovered from the gel rather than the 

double-stranded products. 

5 The following primer pair was used to amplify each DNA (the 419 and 422 

mutations are located internally to the exon 4 fragment amplified by the primer pair 
thus the same primer pair can be used to amplify the wild type and two mutant 
templates). The primer listed as SEQ ID NO:42 (sense primer) contains a biotin label 
at the 5' end and was used in a 100-fold excess over the anti-sense primer of SEQ ID 

10 NO:43. 

To generate the single stranded substrates the following templates were used. 
Ten ng of supercoiled plasmid DNA was used as the target to generate the wild-type 
(plasmid pcTYR-NlTyr) or 422 mutant (plasmid pcTYR-A422) 157 nucleotide 
fragments. Five microliters of the gel purified 339 bp PCR fragment (SEQ ID NO:44) 
15 derived from genomic DNA homozygous for the 419 mutation (described in Example 

10a) was used as the target to generate the 157 nucleotide 419 mutant fragment (SEQ 
ID NO:54). 

For each target DNA, the asymmetric PCR comprised 100 pinoles of SEQ ID 
NO:42 and 1 pmole of SEQ ID NO:43, 50 uM each dNTP, 20 mM Tris-Cl, pH 8.3, 
20 1.5 mM MgCl 2 , 50 mM KC1, with 0.05% TWEEN 20 and 0.05% NONIDET P-40. 

The reaction mixture (45 ul) was overlaid with two drops of light mineral oil and the 
tubes were heated to 95°C for 5 sec then cooled to 70°C. Tag polymerase was then 
added as 1.25 units of enzyme in 5 ul of 20 mM Tris-Cl, pH 8.3, 1.5 mM MgCl 2 , 50 
mM KC1, with 0.05% TWEEN 20 and 0.05% NONIDET P-40. The tubes were 
heated to 95°C for 45 sec, cooled to 50°C for 45 sec, heated to 72°C for 1 min 15 sec 
for 30 repetitions with a 5 min incubation at 72°C after the last repetition. The single 
stranded PCR products were gel purified, precipitated and resuspended in 40 ul of TE 
buffer as described above. 
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Cleavage reactions were performed as follows. Each substrate DNA (100 
fmoles) was placed in a 200 [il thin wall microcentrifuge tube (BioRad) in 5ul of IX 
CFL pTM buffer with 2 mM MnCl 2 . A tube containing 33 fmoles of template DNA in 
10 nl of IX CFLP™ buffer and 1 MnCl 2 was prepared for each template and served 

5 as the no enzyme (or uncut) control. The solution was overlaid with one drop of light 

mineral oil. Tubes were brought to 95°C for 5 seconds to denature the substrates and 
then the tubes were quickly cooled to 65°C. 

Cleavage reactions were started immediately by the addition of a diluted 
enzyme mixture comprising 1 ul of the Cleavase™ BN enzyme [250 ng/ul in IX 

10 dilution buffer (0.5% NP40, 0.5% TWEEN 20, 20 mM Tris-HCl, pH 8.0, 50 mM KC1, 

10 u£/ml BSA)] in 5^1 of IX CFLP™ buffer without MnCl 2 . The enzyme solution 
was at room temperature before addition to the cleavage reaction. After 5 minutes at 
65°C, the reactions were stopped by the addition of 8ul of stop buffer. The samples 
were heated to 72°C for 2 minutes and 7 p.1 of each reaction were resolved by 

15 electrophoresis through a 10% polyacrylamide gel (19:1 cross-link), with 7 M urea, in 

a buffer containing 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA. 

After electrophoresis, the gel plates were separated and overlaid with a nylon 
membrane as described in Example 10a. The DNA was transferred to the membrane 
and the membrane was dried, washed in 1.2X Sequenase IMAGES Blocking Buffer, 

20 treated with IX SAAP buffer and reacted with CDP-Star™ (Tropix) and exposed to 

X-ray film as described in Example 10a. The resulting autoradiograph is shown in 
Figure 32. 

In Figure 32, lanes marked "M" contain molecular weight markers prepared as 
described in Example 10. Lanes 1-3 contain the no enzyme control for the wild type 
25 (SEQ ID NO:47), the 419 mutant (SEQ ID NO:54) and the 422 mutant (SEQ ID 

NO: 5 5) substrates, respectively. Lane 4 contains the cleavage products from the wild 
type template. Lane 5 contains the cleavage products from the 419 mutant. Lane 6 
contains the cleavage products from the 422 mutant. 
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Figure 32 shows that a similar, but distinctly different, pattern of cleavage 
products is generated by digestion of the three template DNAs with the Cleavase™ BN 
enzyme. Note that in the digest of mutant 419, the bands below about 40 nucleotides 
are absent, when compared to wild-type, while in the digest of mutant 422 several new 
bands appear in the 53 nucleotide range. 

Although the three template DNAs differed in only one of the 157 nucleotides, 
a unique pattern of cleavage fragments was generated for each. Thus a single base 
change in a 157 nucleotide fragment gives rise to different secondary structures which 
are recognized by the Cleavase™ enzyme. 

EXAMPLE 12 

Single Base Changes In Large DNA 
Fragments Are Detected By The Cleavase™ BN Enzyme 

The previous example demonstrated that the 5' nuclease activity of the 
Cleavase™ BN enzyme could be used to detect single point mutations within a 157 

15 nucleotide DNA fragment. The ability of the enzyme Cleavase™ BN to detect single 

point mutations within larger DNA fragments is herein demonstrated. 

Increasingly larger fragments derived from the 422 tyrosinase mutant was 
compared to the same size fragments derived from the wild-type tyrosinase gene. Four 
sets of single-stranded substrates were utilized: 1) a 157 nucleotide template derived 

20 from the sense strand of exon 4 from the wild-type (SEQ ID NO:47) and 422 mutant 

(SEQ ID NO:55), 2) a 378 nucleotide fragment containing exons 4 and 5 from the 
wild-type (SEQ ID NO:56) and 422 mutant (SEQ ID NO:57), 3) a 1.059 kb fragment 
containing exons 1-4 from the wild-type (SEQ ID NO:58) and 422 mutant (SEQ ID 
NO:59) and 4) a 1.587 kb fragment containing exons 1-5 from the wild-type (SEQ ID 

25 NO:60) and 422 mutant (SEQ ID NO:61). The only difference between the wild type 

and 422 mutant templates is the G to A change in exon 4 regardless of the length of 
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the template used. The G to A point mutation is located 27, 27, 929 and 1237 
nucleotides from the labeled ends of the 157 base, 378 base, 1.059 kb and 1.6 kb 
substrate DNAs, respectively. 

a) Preparation Of The Substrate DNA 

5 A cDNA clone containing either the wild-type [pcTYR-NlTyr, Bouchard, B., et 

al. (1989) J. Exp. Med. 169:2029] or 422 mutant [pcTYR-A422, Giebel, L.B., et al. 
(1991) 87:1119] tyrosinase gene was utilized as the target DNA in PCRs to generate 
the above substrate DNAs. The primer pair consisting of SEQ ID NOS:42 and 43 
were used to generate a double stranded 157 bp DNA fragment from either the mutant 
10 of wild-type cDNA clone. The primer pah consisting of SEQ ID NO:42 and SEQ ID 

NO:62 was used to generate a double stranded 378 bp DNA fragment from either the 
wild-type or mutant cDNA clone. The primer pair consisting of SEQ ID NO:63 and 
SEQ ID NO:43 was used to generate a double stranded 1.059 kbp DNA fragment from 
either the wild-type or mutant cDNA clone. The primer pair consisting of SEQ ID 
15 NO:64 and SEQ ID NO:62 was used to generate a double stranded 1.587 kbp DNA 

fragment from either the wild-type or mutant cDNA clone. In each case the sense 
strand primer contained a biotin label at the 5' end. 

The PCR reactions were carried out as follows. One to two ng of plasmid 
DNA from the wild-type or 422 mutant was used as the target DNA in a 100 ul 
20 reaction containing 50 uM of each dNTP, 1 [xM of each primer in a given primer pair, 

20 mM Tris-Cl, pH 8.3, 1.5 mM MgCl 2 , 50 raM KC1, with 0.05% TWEEN 20 and 
0.05% NONIDET P-40. Tubes containing the above mixture were overlaid with three 
drops of light mineral oil and the tubes were heated to 94°C for 1 min, then cooled to 
70°C. Taq polymerase was then added as 2.5 units of enzyme in 5 ul of 20 mM Tris- 
25 CI, pH 8.3, 1.5 mM MgCl 2 , 50 mM KC1, with 0.05% TWEEN 20 and 0.05% 

NONIDET P-40. The tube was heated to 93°C for 45 sec, cooled to 52°C for 2 min, 
heated to 72°C for 1 min 45 sec for 35 repetitions, with a 5 min incubation at 72°C 
after the last repetition. 



- 156 



Following the PCR, excess primers were removed using a QIA QUICK-SPIN 
PCR Purification kit (Qiagen, Inc. Chatsworth, CA) following the manufacturer's 
instructions; the DNA was eluted in 50 ul of 10 mM Tris-HCl, pH 8.0, 1 mM EDTA. 
The sense strand of each of the double- stranded fragments from the wild-type and 422 
mutant gene were isolated as follows. Streptavidin-coated paramagnetic beads (Dynal 
M280 beads) [0.5 mg in 50 ul; pre-washed in 2X bind and wash (B&W) buffer (2 M 
NaCl, 10 mM Tris-HCl, pH 7.5, 1 mM EDTA, 0.1% TWEEN 20)] were added to each 
purified PCR product. The samples were incubated at room temperature for 15 
minutes with occasional shaking. The beads were removed from the supernatant by 
exposing the tube to a magnetic plate and the supernatant was discarded. The bead- 
DNA complexes were washed twice in 2X B&W buffer. One hundred microliters of 
0.1 M NaOH were added to the beads and the samples were incubated at room 
temperature for 15 minutes (for the 157, 378 bp DNAs); for DNA fragments larger 
than 1 kb, the beads were incubated at 47°C for 30 minutes. After incubation, the 
beads were washed twice with 2X B&W buffer. Finally, the bead-ssDNA complexes 
were resuspended in 50 ul 2X B&W buffer and stored at 4°C. 



b) Cleavage Reaction Conditions 

The cleavage reactions were performed directly on the single-stranded DNA- 
bead complexes. Five to 10 ul of DNA-bead complex (about 100 fmoles of DNA) 
were placed in a 200 ul microcentrifuge tube and washed once with 10 ul of sterile 
H 2 0. Seven and one half microliters of IX CFLP™ buffer with 1.3 mM MnCl 2 (to 
yield a final concentration of 1 mM) was then added to each tube. The reaction tubes 
were prewarmed to 65°C for 2 minutes and cleavage was initiated by the addition of 
2.5 ul of the Cleavase™ BN enzyme (10-50 ng in IX dilution buffer). The reaction 
was carried out at 65°C for 5 min. 

Immediately after this 5 min incubation, the beads were allowed to settle to the 
bottom of the tube and the supernatant was removed and discarded. Ten to forty 
microliters of stop buffer (95% formamide with 20 mM EDTA and 0,05% xylene 
cyanol and 0.05% bromophenol blue) was then added to the beads and the sample was 
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incubated at 90°C for 5-10 minutes. The formamide/EDTA solution releases the 
biotinylated DNA from the beads. The beads were allowed to settle to the bottom of 
the tube. The supernatant containing the cleavage products was collected. Two to 
eight microliters of the supernatant solution loaded onto 6% polyacrylamide gel (19:1 
cross-link), with 7 M urea, in a buffer containing 45 mM Tris-Borate, pH 8.3, 1.4 mM 
EDTA. 

After electrophoresis, the gel plates were separated and overlaid with a nylon 
membrane as described in Example 10a and allowed to transfer overnight. After 
transfer the membrane was dried, blocked, probed and washed as described in Example 
10a. The blot was reacted with CDP-Star™ (Tropix) and exposed to X-ray film as 
described in Example 10a. The resulting autoradiograph is shown in Figure 33. 

In Figure 33, lanes marked "M" contain molecular weight markers prepared as 
described in Example 10. Lanes 1, 3, 5 and 7 contain cleavage products using the 
157, 378, 1056 or 1587 nucleotide sense strand fragment from the wild-type tyrosinase 
gene, respectively. Lanes 2, 4, 6 and 8 contain cleavage products using the 157, 378, 
1056 or 1587 nucleotide sense strand fragment from the 422 mutant tyrosinase gene, 
respectively. 

As shown in Figure 33, the clear pattern of cleavages seen between the wild 
type and 422 mutant was not obscured when the single base change was located in 
longer DNA fragments. Thus, the cleavage reaction of the invention can be used to 
scan large fragments of DNA for mutations. Fragments greater than about 500 bp in 
length cannot be scanned using existing methodologies such as SSCP or DGGE 
analysis. 
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EXAMPLE 13 

The Cleavase™ Reaction Is Insensitive 
To Large Changes In Reaction Conditions 

The results shown above demonstrated that the Cleavase™ BN enzyme can be 
5 used to probe DNA templates in a structure-specific but sequence independent manner. 

These results demonstrated that the Cleavase™ BN enzyme could be used as an 
efficient way to recognize conformational changes in nucleic acids caused by sequence 
variations. This suggested that the 5' nuclease activity of the Cleavase™ BN enzyme 
could be used to develop a method to scan nucleic acid templates for sequence 
10 alterations relative to a wild-type template. The experiments below showed that this 

was the case. Furthermore it is demonstrated below that the method of the invention is 
relatively insensitive to large changes in conditions thereby making the method suitable 
for practice in clinical laboratories. 

First, the effect of varying the concentration of MnCl 2 on the cleavage reaction 
15 was determined. Second, the effect of different amounts of salt (KC1) on the cleavage 

pattern was examined. Third, a time course was performed to investigate when 
complete cleavage was obtained. Fourth, a temperature titration was performed to 
determine the effect of temperature variations on the cleavage pattern. Next, the 
enzyme was titrated to determine the effect of a 50-fold variation in enzyme 
20 concentration on the cleavage reaction. The results of these experiments showed that 

the Cleavase™ reaction is remarkably robust to large changes in conditions. 

a) MnCl 2 Titration 

To determine the sensitivity of the cleavage reaction to fluctuations in the 
concentration of MnCl 2 , a single template was incubated in the presence of a fixed 
25 amount of the Cleavase™ BN enzyme (250 ng) in a buffer containing 10 mM MOPS, 

pH 8.2 and various amount of MnCl 2 . The cleavage reaction was performed as 
follows. One hundred fmoles of the 157 nucleotide sense strand fragment of the 
tyrosinase gene (SEQ ID NO:55; prepared by asymmetric PCR as described in 

- 159 - 



1W 



10 



Example 11) was placed in a 200 ul thin wall microcentrifuge tube (BioRad) in 5 ul 
of IX CFLP™ buffer with 0, 2, 4, 8, 12 or 20 mM MnCl 2 (to yield a final 
concentration of either 0, 1, 2, 4, 6, 8 or 10 mM MnCl 2 ). A tube containing 100 
fmoles template DNA in 5|xl of IX CFLP™ buffer with 10 MnCl 2 was prepared and 
served as the no enzyme (or uncut) control. Each reaction mixture was overlaid with a 
drop of light mineral oil. The tubes were heated to 95°C for 5 sec and then cooled to 
65°C. 

Cleavage reactions were started immediately by the addition of a diluted 
enzyme mixture comprising 1 ul of the Cleavase™ BN enzyme [250 ng/ul in IX 
dilution buffer (0.5% NP40, 0.5% TWEEN 20, 20 mM Tris-HCl, pH 8.0, 50 mM KCl, 
10 ug/ml BSA)] in 5ul of IX CFLP™ buffer without MnCl 2 . The enzyme solution 
was at room temperature before addition to the cleavage reaction. After 5 minutes at 
65°C, the reactions were stopped by the addition of 8 ul of stop buffer. Samples were 
heated to 72°C for 2 minutes and 8 ul of each reaction were resolved by 
electrophoresis through a 10% poly aery lamide gel (19:1 cross-link), with 7 M urea, in 
a buffer containing 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA. 

After electrophoresis, the gel plates were separated and overlaid with a nylon 
membrane as described in Example 10b. The DNA was transferred to the membrane 
and the membrane was dried, washed in 1.2X Sequenase IMAGES Blocking Buffer, 
treated with IX SAAP buffer and reacted with LUMIPHOS-530 (United States 
Biochemical) or Quantum Yield Chemiluminescent Substrate (Promega Corp., Madison 
WI) and exposed to X-ray film as described in Example 10. The resulting 
autoradiograph is shown in Figure 34. 

In Figure 34, lanes marked "M" contain molecular weight markers. Lane 1 
25 contains the no enzyme control and shows the migration of the uncleaved template 

DNA. Lanes 2 through 8 contain reaction products incubated in the presence of the 
Cleavase™ BN enzyme in a buffer containing 10, 8, 6, 4, 2, 1, or 0 mM MnCl 2 , respectively. 
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Figure 34 shows that no cleavage occurs in the absence of divalent cations 
(lane 8, 0 mM MnCl 2 ). Efficient production of cleavage fragments was promoted by 
the inclusion of MnCl 2 . The most distinct pattern of cleavage seen at 1 mM MnCl 2 
(lane 7), but little change in the pattern was seen when the Mn 2+ concentration varied 
from 1 to 4 mM; High concentrations of MnCl 2 tend to suppress the cleavage reaction 
(concentrations above 6 mM). These results show that the cleavage reaction requires a 
divalent cation but that changes in the amount of divalent cation present have little 
effect upon the cleavage pattern. 



b) Effect Of Salt Concentration On The Cleavage 
10 Reaction 

To determine the effect of salt concentration upon the cleavage reaction, a 
single template was incubated in the presence of a fixed amount of the Cleavase™ BN 
enzyme (250 ng) in a buffer containing 10 mM MOPS, pH 8.2, 1 mM MnCl 2 and 
various amount of KCl. 

15 One hundred fmoles of the 157 base fragment derived from the sense strand of 

exon 4 of the tyrosinase gene (SEQ ID NO:47; prepared as described in Example 10a) 
was placed in a 200 ul thin wall microcentrifuge tube (BioRad) in a buffer containing 
10 mM MOPS, pH 8.2 and 1 mM MnCl 2 . KCl was added to give a final 
concentration of either 0, 10, 20, 30, 40, or 50 mM KCl; the final reaction volume was 

20 10 ul. 

A tube containing 10 mM MOPS, pH 8.2, 1 mM MnCl 2 , 33 fmoles template 
DNA and 50 mM KCl was prepared and served as the no enzyme (or uncut) control. 
Each reaction mixture was overlaid with a drop of light mineral oil. The tubes were 
heated to 95°C for 5 seconds and then cooled to 65°C. 
25 Cleavage reactions were started immediately by the addition of a diluted 

enzyme mixture comprising 1 ^1 of the Cleavase™ BN enzyme [250 ng/ul in IX 
dilution buffer (0.5% NP40, 0.5% TWEEN 20, 20 mM Tris-HCl, pH 8.0, 50 mM KCl, 
10 ng/ml BSA)] in 5^1 of IX CFLP™ buffer without MnCl 2 . The enzyme solution 
was at room temperature before addition to the cleavage reaction. After 5 minutes at 
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65°C, the reactions were stopped by the addition of 8|al of stop buffer. Samples were 
heated to 72°C for 2 minutes and 8 ul of each reaction were resolved by 
electrophoresis through a 10% polyacrylamide gel (19:1 cross-link), with 7 M urea, in 
a buffer containing 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA. 

5 After electrophoresis, the gel plates were separated and overlaid with a nylon 

membrane, as described in Example 10b. The DNA was transferred to the membrane 
and the membrane was dried, washed in 1.2X Sequenase IMAGES Blocking Buffer, 
treated with IX SAAP buffer and reacted with LUMIPHOS-530 (United States 
Biochemical) or Quantum Yield Chemiluminescent Substrate (Promega Corp., Madison 

10 WI) and exposed to X-ray film as described in Example 10. The resulting 

autoradiograph is shown in Figure 35. 

In Figure 35 , lanes marked "M" contain molecular weight markers. Lane 1 
contains the no enzyme control and shows the migration of the uncleaved template 
DNA. Lanes 2 through 7 contain reaction products incubated in the presence of the 

15 Cleavase™ BN enzyme in a buffer containing 50, 40, 30, 20, 10 or 0 mM KC1, 

respectively. 

The results shown in Figure 35 show that the Cleavase™ reaction is relatively 
insensitive to variations in salt concentration. The same cleavage pattern was obtained 
when the 157 nucleotide tyrosinase DNA template (SEQ ID NO:47) was incubated 
20 with the enzyme Cleavase™ regardless of whether the KC1 concentration varied from 

0 to 50 mM. 

c) Time Course Of The Cleavage Reaction 

To determine how quickly the cleavage reaction is completed, a single template 
was incubated in the presence of a fixed amount of the Cleavase™ BN enzyme for 
25 various lengths of time. A master mix comprising 20 ul of a solution containing IX 

CFLP™ buffer, 2 mM MnCl 2 , and 400 fmoles of the 157 base fragment derived from 
the sense strand of exon 4 of the tyrosinase gene [(SEQ ID NO:47); prepared as 
described in Example 10b] was made. Five microliter aliquots were placed in 200 ul 
thin wall microcentrifuge tube (BioRad) for each time point examined. A no enzyme 
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control tube was run; this reaction contained 33 fmoles of the template DNA in IX 
CFLP™ buffer with 1 mM MnCl 2 (in a final reaction volume of lOul). The solutions 
were overlaid with one drop of light mineral oil. The tubes were brought to 95°C for 
5 seconds to denature the templates and then the tubes were cooled to 65°C. 

Cleavage reactions were started immediately by the addition of a diluted 
enzyme mixture comprising 1 ul of the Cleavase™ BN enzyme [250 ng per ul in IX 
dilution buffer (0.5% NP40, 0.5% TWEEN 20, 20 mM Tris-HCl, pH 8.0, 50 mM KCl, 
10 ug/ml BSA)] in 5 p.1 of IX CFLP™ buffer without MnCl 2 . Immediately at the 
indicated time points, the reaction was stopped by the addition of 8 ul of 95% 
formamide containing 20 mM EDTA and 0.05% each xylene cyanol and bromophenol 
blue. The no enzyme control was incubated at 65°C, for 10 minutes and treated in the 
same manner as the other reactions by the addition of 8 ul of stop buffer. Samples 
were heated to 72°C for 2 minutes and 5 ul of each reaction were resolved by 
electrophoresis through a 10% polyacrylamide gel (19:1 cross-link), with 7 M urea, in 
15 a buffer containing 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA. 

After electrophoresis, the gel plates were separated and overlaid with a nylon 
membrane as described in Example 10b. The DNA was transferred to the membrane 
and the membrane was dried, washed in 1.2X Sequenase IMAGES Blocking Buffer, 
treated with IX SAAP buffer and reacted with LUMIPHOS-530 (United States 
20 Biochemical) or Quantum Yield Chemiluminescent Substrate (Promega Corp., Madison 

WI) and exposed to X-ray film as described in Example 10b. The resulting 
autoradiograph is shown in Figure 36. 

In Figure 36, lanes marked "M" contain molecular weight markers prepared as 
described in Example 10. Lane 1 contains the no enzyme control incubated for 10 
25 minutes. Lanes 2-5 contain the cleavage products from reactions incubated for 0.1, 1, 

5 or 10 minutes at 65°C. Figure 36 shows that the cleavage reaction mediated by the 
Cleavase™ BN enzyme is very rapid. Cleavage is already apparent at less than 6 
seconds (<0.1 min) and is complete within one minute. These results also show that 
the same pattern of cleavage is produced whether the reaction is run for 1 or 10 
30 minutes. 
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d) Temperature Titration Of The Cleavase Reaction 

To determine the effect of temperature variation on the cleavage pattern, the 
157 base fragment derived from the sense strand of exon 4 of the tyrosinase gene 
(SEQ ID NO:47) was incubated in the presence of a fixed amount of the enzyme 
Cleavase™ BN for 5 minutes at various temperatures. One hundred fmoles of 
substrate DNA (prepared as described in Example 10b) was placed in a 200 ul thin 
wall microcentrifuge tube (BioRad) in 5 ul of IX CFLP™ buffer with 2 mM MnCl, 
Two "no enzyme" test control tubes were set-up as above with the exception that these 
reactions contained 33 fmoles of substrate DNA in 10 ul of the above buffer with 1 
mM MnCl 2 . The solution was overlaid with one drop of light mineral oil. Tubes were 
brought to 95°C for 5 seconds to denature the templates and then the tubes were 
cooled to the desired temperature. 

Cleavage reactions were started immediately by the addition of a diluted 
enzyme mixture comprising 1 ul of the Cleavase™ BN enzyme [250 ng per ul in IX 
dilution buffer (0.5% NP40, 0.5% TWEEN 20, 20 mM Tns-HCl, pH 8.0, 50 mM KC1, 
10 jig/ml BSA)] in 5ul of IX CFLP™ buffer without MnCl 2 . The tubes placed at 
either 55°, 60°, 65°, 70°, 75° or 80°C. After 5 minutes at a given temperature, the 
reactions were stopped by the addition of 8ul of stop buffer. 

Samples were heated to 72°C for 2 minutes and 5 ul of each reaction were 
20 resolved by electrophoresis through a 10% polyacrylamide gel (19:1 cross-link), with 7 

M urea, in a buffer containing 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA. 

After electrophoresis, the gel plates were separated and overlaid with a nylon 
membrane as described in Example 10b. The DNA was transferred to the membrane 
and the membrane was dried, washed in 1.2X Sequenase IMAGES Blocking Buffer, 
treated with IX SAAP buffer and reacted with LUMIPHOS-530 (United States 
Biochemical) or Quantum Yield Chemiluminescent Substrate (Promega Corp., Madison 
WI) and exposed to X-ray film as described in Example 10. The resulting 
autoradiograph is shown in Figure 37. 
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In Figure 37, the lanes marker "M" contain molecular weight markers prepared 
as described in Example 10. Lanes 1 and 2 contain no enzyme controls incubated at 
55°C and 80°C, respectively. Lanes 3-8 contain the cleavage products from the 
Cleavase™ enzyme-containing reactions incubated at 55°C, 60°C, 65°C, 70°C, 75°C or 

5 80°C, respectively. 

Figure 37 shows that the Cleavase™ reaction can be performed over a wide 
range of temperatures. The pattern of cleavages changed progressively in response to 
the temperature of incubation, in the range of 55°C to 75°C. Some bands were 
evident only upon incubation at higher temperatures. Presumably some structures 
10 responsible for cleavage at the intermediate temperatures were not favored at the lower 

temperatures. As expected, cleavages became progressively less abundant in the high 
end of the temperature range tested as structures were melted out. At 80°C cleavage 
was inhibited completely presumably due to complete denaturation of the template. 
These results show that the cleavage reaction can be performed over a wide 
15 range of temperatures. The ability to run the cleavage reaction at elevated 

temperatures is important. If a strong (i.e., stable) secondary structure is assumed by 
the templates, a single nucleotide change is unlikely to significantly alter that structure, 
or the cleavage pattern it produces. Elevated temperatures can be used to bring 
structures to the brink of instability, so that the effects of small changes in sequence 
20 are maximized, and revealed as alterations in the cleavage pattern. within the target 

template, thus allowing the cleavage reaction to occur at that point. 

e) Titration Of The Cleavase™ BN enzyme 

The effect of varying the concentration of the Cleavase™ BN enzyme in the 
cleavage reaction was examined. One hundred fmoles of the 157 base fragment 
25 derived from the sense strand of exon 4 of the tyrosinase gene (SEQ ID NO:47) was 

placed in 4 microcentrifuge tubes in 5 ul of IX CFLP™ buffer with 2 mM MnCl, A 
no enzyme control tube was run; this reaction contained 33 fmoles of substrate DNA 
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in 10 nl of IX CFLP™ buffer containing 1 mM MnCl 2 . The solutions were overlaid 
with one drop of light mineral oil. The tubes were brought to 95°C for 5 seconds to 
denature the templates and then the tubes were cooled to 65 °C. 

Cleavage reactions were started immediately by the addition of a diluted 
enzyme mixture comprising 1 ul of the Cleavase™ BN enzyme in IX dilution buffer 
such that 10, 50, 100 or 250 ng of enzyme was in the tubes in 5 ul of IX CFLP™ 
buffer without MnCl 2 . After 5 minutes at 65°C, the reactions were stopped by the 
addition of 8 ul of stop buffer. The samples were heated to 72°C for 2 minutes and 7 
Hi of each reaction were resolved by electrophoresis through a 10% poly aery lamide gel 
(19:1 cross-link), with 7 M urea, in a buffer containing 45 mM Tris-Borate, pH 8.3, 
1.4 mM EDTA. 

After electrophoresis, the gel plates were separated and overlaid with a nylon 
membrane as described in Example 10b. The DNA was transferred to the membrane 
and the membrane was dried, washed in 1.2X Sequenase IMAGES Blocking Buffer, 
treated with IX SAAP buffer and reacted with LUMIPHOS-530 (United States 
Biochemical) or Quantum Yield Chemiluminescent Substrate (Promega Corp., Madison 
WI) and exposed to X-ray film as described in Example 10. The resulting 
autoradiograph is shown in Figure 38. 

The lanes marked "M" in Figure 38 contain molecular weight markers. Lane 1 
20 contains the no enzyme control and shows the migration of the uncut substrate. Lanes 

2-5 contain reaction products from reactions containing 10, 50, 100 or 250 ng of the 
Cleavase™ BN enzyme, respectively. 

These results show that the same cleavage pattern was obtained using the 157 
nucleotide tyrosinase DNA substrate regardless of whether the amount of enzyme used 
25 in the reaction varied over a 25-fold range. Thus, the method is ideally suited for 

practice in clinical laboratories where reactions conditions are not as controlled as in 
research laboratories. 
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f) Consistent Cleavage Patterns Are Obtained Using 
Different DNA Preparations 

To demonstrate that the same cleavage pattern is consistently obtain from a 
given substrate, several different preparations of the 157 base fragment derived from 
the sense strand of exon 4 of the tyrosinase gene (SEQ ID NO:47) were generated. 
The substrate was generated as described in Example 10b. Three independent PCR 
reactions performed on separate days were conducted. One of these PCR samples was 
split into two and one aliquot was gel-purified on the day of generation while the other 
aliquot was stored at 4°C overnight and then gel-purified the next day. 

Cleavage reactions were performed as described in Example 10b. Samples 
were run on an acrylamide gel and processed as described in Example 10b. The 
resulting autoradiograph is shown in Figure 39. 

In Figure 39, the lanes marked "M" contain biotinylated molecular weight 
markers. Set 1 contains the products from a cleavage reaction performed in the 
absence (-) or presence (+) of enzyme on preparation no. 1. Set 2 contains the 
products from a cleavage reaction performed in the absence (-) or presence (+) of 
enzyme on preparation no. 2. Set 3 contains the products from a cleavage reaction 
performed in the absence (-) or presence (+) of enzyme on preparation no. 3. 
Preparation no. 3 was derived from preparation 2 and is identical except that 
preparation no. 3 was gel-purified one day after preparation no. 2. Set 4 contains the 
products from a cleavage reaction performed in the absence (-) or presence (+) of 
enzyme on preparation no. 4. The same pattern of cleavage products is generated 
from these independently prepared substrate samples. 

These results show that independently produced preparations of the 157 
nucleotide DNA fragment gave identical cleavage patterns. Thus, the Cleavase™ 
reaction is not effected by minor differences present between substrate preparations. 
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EXAMPLE 14 

Point Mutations Are Detected Using Either The 
Sense Or Anti-Sense Strand Of The Tyrosinase Gene 

The ability of the Cleavase™ enzyme to create a unique pattern of cleavage 
5 products (i.e., a fingerprint) using either the sense (coding) or anti-sense (non-coding) 

strand of a gene fragment was examined. 

Single stranded DNA substrates corresponding to either the sense (SEQ ID 
NO:47) or anti-sense strand (SEQ ID NO:48) of the 157 nucleotide fragment derived 
from the wild-type tyrosinase gene were prepared using asymmetric PCR as described 
10 in Example 10a. The sense strand wild-type substrate contains a biotin label at the 5' 

end; the anti-sense strand contains a fluorescein label at the 5' end. 

A single stranded DNA substrate corresponding to the sense strand of the 157 
nucleotide fragment derived from the 419 mutant tyrosinase gene (SEQ ID NO:54) 
was prepared using asymmetric PCR as described in Example 11. The sense strand 
15 419 mutant substrate contains a biotin label at the 5' end. 

A single stranded DNA substrate corresponding to the anti-sense strand of the 
157 nucleotide fragment derived from the 419 mutant tyrosinase gene (SEQ ID 
NO:65) was prepared using asymmetric PCR as described in Example 11 with the 
exception that 100 pmoles of the fluorescein-labeled anti-sense primer (SEQ ID 
20 NO:43) and 1 pmole of the biotin-labelled sense primer (SEQ ID NO:42) were used. 

The resulting anti-sense strand 419 mutant substrate contains a fluorescein label at the 
5' end. 

A single stranded DNA substrate corresponding to the sense strand of the 157 
nucleotide fragment derived from the 422 mutant tyrosinase gene (SEQ ID NO:55) 
25 was prepared using asymmetric PCR as described in Example 11. The sense strand 

422 mutant substrate contains a biotin label at the 5' end. 

A single stranded DNA substrate corresponding to the anti-sense strand of the 
157 nucleotide fragment derived from the 422 mutant tyrosinase gene (SEQ ID 
NO:66) was prepared using asymmetric PCR as described in Example 11 with the 
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exception that 100 pmoles of the fluorescein-labeled anti-sense primer (SEQ ID 
NO:43) and 1 pmole of the biotin-labelled sense primer (SEQ ID NO:42) were used. 
The resulting anti-sense strand 422 mutant substrate contains a fluorescein label at the 
5' end. 

5 Following asymmetric PCR, the single stranded PCR products were gel 

purified, precipitated and resuspended in 40 ul of TE buffer as described in Example 
10. 

Cleavage reactions were performed as described in Example 11. Following the 
cleavage reaction, the samples were resolved by electrophoresis as described in 

10 Example 10a. After electrophoresis, the gel plates were separated allowing the gel to 

remain flat on one plate. A 0.2 um-pore positively-charged nylon membrane 
(Schleicher and Schuell, Keene, NH), pre-wetted in 0.5X TBE (45 mM Tris-Borate, 
pH 8.3, 1.4 mM EDTA), was laid on top of the exposed acrylamide gel. All air 
bubbles trapped between the gel and the membrane were removed. Two pieces of 

15 3 MM filter paper (Whatman) were then placed on top of the membrane, the other 

glass plate was replaced, and the sandwich was clamped with binder clips. Transfer 
was allowed to proceed overnight. After transfer, the membrane was carefully peeled 
from the gel and allowed to air dry. After complete drying, the membrane was 
washed twice in 1.5X Sequenase IMAGES Blocking Buffer (United States 

20 Biochemical) for 30 minutes/wash. Three tenths of a ml of the buffer was used per 

cm 2 of membrane. The following reagents were added directly to the blocking 
solution: a streptavidin- alkaline phosphatase conjugate (SAAP, United States 
Biochemical) added at a 1:4000 final dilution and an anti- fluorescein antibody (Fab)- 
alkaline phosphatase conjugate (Boehringer Mannheim Biochemicals, Indianapolis, IN) 

25 added at a 1:20,000 final dilution. The membrane was agitated for 15 minutes. The 

membrane was rinsed briefly with H 2 0 and then washed 3 times (5 minutes/wash) in 
IX SAAP buffer (100 mM Tris-HCL, pH 10; 50 mM NaCl) with 0.05% SDS and 
0.025% TWEEN 20 using 0.5 ml buffer/cm 2 of the buffer, with brief H 2 0 rinses 
between each wash. The membrane was then washed once in IX SAAP buffer 
30 without SDS or TWEEN 20, drained thoroughly and placed in a plastic heat-sealable 
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bag. Using a sterile pipet tip, 0.05 ml/cm 2 of CDP-Star™ (Tropix, Bedford, MA) was 
added to the bag and distributed over the entire membrane for 5 minutes. The bag 
was drained of all excess liquid and air bubbles. The membrane was then exposed to 
X-ray film (Kodak XRP) for an initial 30 minutes. Exposure times were adjusted as 
5 necessary for resolution and clarity. The resulting autoradiograph is shown in Figure 

40. 

In Figure 40, lanes marked "M" contain biotmylated molecular weight markers 
prepared as described in Example 10. Lanes 1-6 contain biotinylated sense strand 
substrates from the wild-type, 419 and 422 mutant 157 nucleotide fragments. Lanes 1- 

10 3 contain no enzyme controls for the wild-type, 419 and 422 mutant fragments, 

respectively. Lanes 4-6 contain the reaction products from the incubation of the sense 
strand of the wild-type, 419 and 422 mutant fragments with the Cleavase™ BN 
enzyme, respectively. Lanes 7-12 contain fluoresceinated anti-sense strand substrates 
from the wild-type, 419 and 422 mutant 157 nucleotide fragments. Lanes 1-3 contain 

15 "no enzyme" controls for the wild-type, 419 and 422 mutant fragments, respectively. 

Lanes 4-6 contain the reaction products from the incubation of the anti-sense strand of 
the wild-type, 419 and 422 mutant fragments with the enzyme Cleavase™ BN, 
respectively. 

As expected, distinct but unique patterns of cleavage products are generated for 
20 the wild-type, 419 and 422 mutant fragments when either the sense or anti-sense 

fragment is utilized. The ability to use either the sense or anti-sense strand of a gene 
as the substrate is advantageous because under a given set of reaction conditions one 
of the two strands may produce a more desirable banding pattern (i.e., the cleavage 
products are spread out over the length of the gel rather than clustering at either end), 
or may have a mutation more favorably placed to create a significant structural shift. 
This could be more important in the analysis of long DNA substrates which contain 
mutations closer to one end or the other. Additionally, detection on both strands 
serves as a confirmation of a sequence change. 
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EXAMPLE 15 

Detection Of Mutations In The Human 
Beta-Globin Gene Using The Cleavase™ Enzyme 

The results shown in Examples 10-14 showed that the Cleavase™ reaction 
5 could be used to detect single base changes in fragments of the tyrosinase gene ranging 

from 157 nucleotides to 1.6 kb. To demonstrate that the Cleavase™ reaction is 
generally applicable for the detection of mutations, a second model system was 
examined. 

The human p-globin gene is known to be mutated in a number of 

10 hemoglobinopathies such as sickle cell anemia and p-thalassemia. These disorders 

generally involve small (1 to 4) nucleotide changes in the DNA sequence of the wild 
type p-globin gene [Orkin, S.H. and Kazazian, H.H., Jr. (1984) Annu. Rev. Genet. 
18:131 and Collins, F.S. and Weissman, S.M. (1984) Prog. Nucleic Acid Res. Mol. 
Biol. 31:315]. At least 47 different mutations in the p-globin gene have been 

15 identified which give rise to a P-thalassemia. 

Three P-globin mutants were compared to the wild type p-globin gene [Lawn, 
R.M., et al. (1980) Cell 21:647] using the Cleavase™ reaction. Mutant 1 contains a 
nonsense mutation in codon 39; the wild-type sequence at codon 39 is CAG; the 
mutant 1 sequence at this codon is TAG [Orkin, S.H. and Goff, S.C. (1981) J. Biol. 

20 Chem. 256:9782]. Mutant 2 contains a T to A substitution in codon 24 which results 

in improper splicing of the primary transcript [Goldsmith, M.E., et al. (1983) Proc. 
Natl. Acad. Sci. USA 80:2318]. Mutant 3 contains a deletion of two A residues in 
codon 8 which results in a shift in the reading frame; mutant 3 also contains a silent C 
to T substitution in codon 9 [Orkin, S.H. and Goff, S.C. (1981) J. Biol. Chem. 

25 256:9782]. 
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a) Preparation Of Wild Type And Mutant 0-Globin 
Gene Substrates 

Single stranded substrate DNA was prepared from the above wild type and 
mutant p-globin genes as follows. Bacteria harboring the appropriate plasmids were 
streaked onto antibiotic plates and grown overnight at 37°C (bacteria with the wild- 
type plasmid and the plasmid containing the mutant 3, were grown on tetracycline 
plates; bacteria with the plasmids containing the mutant 1 and mutant 2 sequences 
were grown on ampicillin plates). Colonies from the plates were then used to isolate 
plasmid DNAs using the WIZARD Minipreps DNA Purification System (Promega 
Corp., Madison, WI). The colonies were resuspended in 200 |il of "Cell Resuspension 
Buffer" from the kit. The DNA was extracted according to the manufacturers 
protocol. Final yields of approximately 2.5 >ig of each plasmid were obtained. 

A 536 (wild-type, mutants 1 and 2) or 534 (mutant 3) nucleotide fragment was 
amplified from each of the above plasmids in polymerase chain reactions comprising 
5 ng of plasmid DNA, 25 pmoles each of 5'-biotinylated KM29 primer (SEQ ID 
NO:67) and 5 '-fluorescein labeled RS42 primer (SEQ ID NO:68), 50 uM each dNTP 
and 1.25 units of Tag DNA Polymerase in 50 ul of 20 mM Tris-Cl, P H 8.3, 1.5 mM 
MgCl 2 , 50 mM KC1 with 0.05% each TWEEN-20 and NONIDET P-40. The reactions 
were overlaid with 2 drops of light mineral oil and were heated to 95°C for 30 
seconds, cooled to 55°C for 30 seconds, heated to 72°C for 60 seconds, for 35 
repetitions in a thermocycler (MJ Research, Watertown, MA). The products of these 
reactions were purified from the residual dNTPs and primers by use of a WIZARD 
PCR Cleanup kit (Promega Corp., Madison, WI), leaving the duplex DNA in 50 ul of 
10 mM Tris-CL, pH 8.0, 0.1 mM EDTA. 

To generate single stranded copies of these DNAs, the PCRs described above 
were repeated using 1 ul of the duplex PCR DNA as template, and omitting the RS42 
primer. The products of this asymmetric PCR were loaded directly on a 6% 
polyacrylamide gel (29:1 cross-link) in a buffer of 45 mM Tris-Borate, pH 8.3, 1.4 
mM EDTA, alongside an aliquot of the original PCR DNA to identify the location of 
the double-strand DNA product. After electrophoretic separation, the DNAs were 
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visualized by staining with ethidium bromide and the single stranded DNAs, identified 
by altered mobility when compared to the duplex DNAs, were excised and eluted from 
the gel slices by passive diffusion overnight into a solution comprising 0.5 M NH 4 0Ac, 
0.1% SDS and 0.2 mM EDTA. The products were collected by ethanol precipitation 
5 and dissolved in 40 ul of 10 mM Tris-Cl, P H 8.0, 0.1 mM EDTA. 

The sequence of the 536 nucleotide fragment from the wild-type fi-globin gene 
is listed in SEQ ID NO:69. The sequence of the 534 nucleotide fragment from mutant 
3 is listed in SEQ ID NO:70. The sequence of the 536 nucleotide fragment from 
mutant 1 is listed in SEQ ID NO:71. The sequence of the 536 nucleotide fragment 
10 from mutant 2 is listed in SEQ ID NO:72. 

b) Optimization Of The Cleavage Reaction Using The 
Wild-Type Beta-Globin Substrate 

The optimal conditions (salt concentration, temperature) which produce an array 
of cleavage products having widely differing mobilities from the p-globin substrate 
1 5 were determined. Conditions which produce a cleavage pattern having the broadest 

spread array with the most uniform intensity between the bands were determined (the 
production of such an array of bands aids in the detection of differences seen between 
a wild-type and mutant substrate). This experiment involved running the cleavage 
reaction on the wild type p-globin substrate (SEQ ID NO:69) at several different 
20 temperatures in the presence of either no KC1 or 50 mM KC1. 

For each KCL concentration to be tested, 30 jil of a master mix containing 
DNA, CFLP™ buffer and salts was prepared. For the "0 mM KC1" reactions, the mix 
included approximately 500 fmoles of single-stranded, 5' biotinylated 536-mer PCR 
DNA from plasmid pHBGl in 30 jil of IX CFLP™ buffer (10 mM MOPS, pH 8.2) 
25 with 1.7 mM MnCl 2 (for 1 mM in the final reaction); the "50 mM KC1" mix included 

83.3 mM KC1 in addition to the above components. The mixes were distributed into 
labeled reaction tubes in 6 |al aliquots, and stored on ice until use. An enzyme 
dilution cocktail included 450 ng of the Cleavase™ BN enzyme in IX CFLP™ buffer 
without MnCl 2 - 
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Cleavage reactions were performed at 60°C, 65°C, 70°C and 75°C. For each 
temperature to be tested, a pair of tubes with and without KC1 were brought to 95°C 
for 5 seconds, then cooled to the selected temperature. The reactions were then started 
immediately by the addition of 4 ul of the enzyme cocktail. In the 75°C test, a 
5 duplicate pair of tubes was included, and these tubes received 4 ul of IX CFLP™ 

buffer without MnCl 2 in place of the enzyme addition. No oil overlay was used. All 
reactions proceeded for 5 minutes, and were stopped by the addition of 8 jal of stop 
buffer. Completed and yet-to-be- started reactions were stored on ice until all reactions 
had been performed. Samples were heated to 72°C for 2 minutes and 5 ul of each 
10 reaction was resolved by electrophoresis through a 6% polyacrylamide gel (19:1 cross- 

link), with 7 M urea, in a buffer of 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA. 
After electrophoresis, the gel plates were separated allowing the gel to remain flat on 
one plate. A 0.2 um-pore positively-charged nylon membrane (NYTRAN, Schleicher 
and Schuell, Keene, NH), pre-wetted in H 2 0, was laid on top of the exposed gel. All 
15 air bubbles were removed. Two pieces of 3 MM filter paper (Whatman) were then 

placed on top of the membrane, the other glass plate was replaced, and the sandwich 
was clamped with binder clips. Transfer was allowed to proceed overnight. After 
transfer, the membrane was carefully peeled from the gel and allowed to air dry. 
After complete drying the membrane was washed in 1.2X Sequenase IMAGES 
20 Blocking Buffer (United States Biochemical) using 0.3 ml of buffer/cm 2 of membrane. 

The wash was performed for 30 minutes. A streptavidin-alkaline phosphatase 
conjugate (SAAP, United States Biochemical) was added to a 1:4000 dilution directly 
to the blocking solution, and agitated for 15 minutes. The membrane was rinsed 
briefly with H 2 0 and then washed three times for 5 minutes per wash using 0.5 ml/cm 2 
25 of IX SAAP buffer (100 mM Tris-HCl, pH 10, 50 mM NaCl) with 0.1% sodium 

dodecyl sulfate (SDS). The membrane was rinsed briefly with H 2 0 between each 
wash. The membrane was then washed once in IX SAAP/1 mM MgCl 2 without SDS, 
drained thoroughly and placed in a plastic heat-sealable bag. Using a sterile pipet, 5 
mis of either CSPD™ or CDP-Star™ (Tropix, Bedford, MA) chemiluminescent 
30 substrates for alkaline phosphatase were added to the bag and distributed over the 
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entire membrane for 2-3 minutes. The CSPD™-treated membranes were incubated at 
37°C for 30 minutes before an initial exposure to XRP X-ray film (Kodak) for 60 
minutes. CDP-Star™-treated membranes did not require preincubation, and initial 
exposures were for 10 minutes. Exposure times were adjusted as necessary for 
resolution and clarity. The results are shown in Figure 41. 

In Figure 41, the lane marked "M" contains molecular weight markers. 
Lanes 1-5 contain reaction products from reactions run in the absence of KC1. Lane 1 
contains the a reaction run without enzyme at 75°C. Lanes 2-5 contain reaction 
products from reactions run at 60°C, 65°C, 70°C and 75°C, respectively. Lanes 6-10 
contain reaction products from reactions run in the presence of 50 mM KC1. Lane 6 
contains the a reaction run without enzyme at 75°C. Lanes 7-10 contain reaction 
products from reactions run at 60°C, 65°C, 70°C and 75°C, respectively. 

In general, a preferred pattern of cleavage products was produced when the 
reaction included 50 mM KC1. As seen in Lanes 7-10, the reaction products are more 
15 widely spaced in the 50 mM KCL-containing reactions at every temperature tested as 

compared to the reactions run in the absence of KCL (lanes 2-5; more of the cleavage 
products are found clustered at the top of the gel near the uncut substrate). As seen in 
Lane 7 of Figure 41, cleavage reactions performed in 50 mM KC1 at 60°C produced a 
pattern of cleavage products in which the products are maximally spread out, 
20 particularly in the upper portion of the gel, when compared to other reaction condition 

patterns. 

From the results obtained in this experiment, the optimal cleavage conditions 
for the 536 nucleotide sense strand fragment derived from the wild-type (3-globin gene 
(SEQ ID NO:69) were determined to be IX CFLP™ buffer containing 1 mM MnCl 2 
25 and 50 mM KC1 at 60°C. 
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c) Optimization Of The Cleavage Reaction Using Two 
Mutant Beta-Globin Substrates 

From the results obtained above in a) and b), 60°C was chosen as the optimum 
temperature for the cleavage reaction when a p-globin substrate was to be used. When 
the wild-type substrate was utilized, running the cleavage reaction in the presence of 
50 mM KC1 generate the optimal pattern of cleavage products. The effect of varying 
the concentration of KC1 upon the cleavage pattern generated when both wild-type and 
mutant p-globin substrates were utilized was next examined to determine the optimal 
salt concentration to allow a comparison between the wild-type and mutant p-globin 
substrates. 

Single stranded substrates, 536 nucleotides in length, corresponding to mutant 1 
(SEQ ID NO:71) and mutant 2 (SEQ ID NO:72) mutations were prepared as described 
above in a). These two mutants each differ from the wild-type sequence by 1 
nucleotide; they differ from each other by 2 nucleotides. 

For each substrate tested, 39 ul of a master mix containing DNA, CFLP™ 
buffer and MnCl 2 was prepared. These mixes each included approximately 500 fmoles 
of single-stranded, 5' biotinylated 536 nucleotide substrate DNA, 39 ul of IX CFLP™ 
buffer containing 1.54 mM MnCl 2 (giving a final concentration of 1 mM MnCl 2 ). The 
mixes were distributed into labeled reaction tubes in 6.5 ul aliquots. Each aliquot then 
received 0.5 ul of 200 mM KC1 for each 10 mM final KC1 concentration (e.g., 2.0 ul 
added to the 40 mM reaction tube) and all volumes were brought to 9 ul with dH 2 0. 
No oil overlay was used. The reactions were brought to 95°C for 5 seconds, then 
cooled to 65°C. The reactions were then started immediately by the addition of 50 ng 
of the Cleavase™ BN enzyme in 1 ul of enzyme dilution buffer (20 mM Tris-HCl, pH 
8.0, 50 mM KC1, 0.5% NP40, 0.5% TWEEN 20, 10 ng/ml BSA). All reactions 
proceeded for 5 minutes, and were stopped by the addition of 8 ul of stop buffer. 
Samples were heated to 72°C for 2 minutes and 5 ul of each reaction was resolved by 
electrophoresis through a 6% polyacrylamide gel (19:1 cross-link), with 7 M urea, in a 
buffer of 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA. 
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After electrophoresis, the gel plates were separated and overlaid with a nylon 
membrane as described above. The DNA was transferred to the membrane and the 
membrane was treated as described above in b) and then exposed to X-ray film. The 
resulting autoradiograph is shown in Figure 42. 
5 In Figure 42, the lane marked "M" contains molecular weight markers. 

Lanes 1, 3, 5, 7, 9 and 11 contain reaction products from cleavage reactions using the 
mutant I substrate in the presence of 0, 10, 20, 30, 40 or 50 mM KCl, respectively. 
Lanes 2 4, 6, 8, 10 and 12 contain reaction products from cleavage reactions using the 
mutant 2 substrate in the presence of 0, 10, 20, 30, 40 or 50 mM KCl, respectively. 
10 Figure 42 shows that while the pattern of cleavage products generated from 

each mutant changes as the KCl concentration is increased, distinct patterns are 
generated from each mutant and differences in banding patterns are seen between the 
two mutants at every concentration of KCl tested. In the mid-salt ranges (10 to 20 
mM KCl), the larger cleavage bands disappear and smaller molecular weight bands 
15 appear (lanes 3-6). At higher salt concentrations (30 to 50 mM KCl), the larger 

molecular weight cleavage bands reappear with the cominant loss of the low molecular 
weight bands (lanes 7-12). Reaction conditions comprising the use of 50 mM KCl 
were chosen as optimal from the results show in Figure 42. Clear differences in the 
intensities of a band appearing about 200 nucleotides (see arrow in Figure 42) is seen 
20 between the two mutant substrates under these reaction conditions. 



d) The Cleavase™ Enzyme Generates Unique Cleavage 
Products From Wild-Type And Mutant Beta-Globin 
Substrates 

From the experiments performed above, the optimal reaction conditions when 
25 the wild-type or mutant (3-globin substrates were determined to be the use of 50 mM 

KCl and a temperature of 60°C. These conditions were then used to allow the 
comparison of the cleavage patterns generated when the wild-type substrate (SEQ ID 
NO:69) was compared to the mutant 1 (SEQ ID NO:71), mutant 2 (SEQ ID NO:72) 
and mutant 3 (SEQ ID NO:70) substrates. 



177 - 



10 



15 



Single-stranded substrate DNA, 534 or 536 nucleotides in length, was prepared 
for the wild-type, mutant 1, mutant 2 and mutant 3 p-globm genes as described above 
in a). Cleavage reactions were performed as follows. Reaction tubes were assembled 
winch contained approximately 100 fmoles of each DNA substrate in 9 ul of 1.1X 
CFLP™ buffer (IX final concentration) with 1.1 mM MnCl 2 (1 mM final 
concentration) and 55.6 mM KC1 (50 mM final concentration). A "no enzyme" or 
uncut control was set up for each substrate DNA. The uncut controls contained one 
third as much DNA ( approximately 33 fmoles) as did the enzyme-containing reactions 
to balance the signal seen on the autoradiograph. 

The tubes were heated to 95°C for 5 sec, cooled to 60°C and the reactions were 
started immediately by the addition of 1 ul of the Cleavase™ BN enzyme (50 ng per 
ul in IX dilution buffer). The uncut controls received 1 ul of IX dilution buffer. 

Reactions proceeded for 5 mm and then were stopped by the addition of 8 ul 
of stop buffer. The samples were heated to 72°C for 2 min and 5 p.1 of each reaction 
was resolved by electrophoresis through a 6% polyacrylamide gel (19:1 cross-link), 
with 7 M urea, in a buffer of 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA. 

After electrophoresis, the gel plates were separated and overlaid with a nylon 
membrane as described above. The DNA was transferred to the membrane and the 
membrane was treated as described above in b) and then exposed to X-ray film. The 
20 resulting autoradiograph is shown in Figure 43. 

In Figure 43, two panels are shown. The first panel shows the reaction 
products from the above cleavage reactions; the uncut controls are shown in lanes 1-4 
and the cleavage products are shown in lanes 5-6. The second panel is a magnification 
of lanes 5-8 to better shown the different banding patterns seen between the substrate 
25 DNAs in the upper portion of the gel. 

In Figure 43, the lanes marked "M" contain biotinylated molecular weight 
markers prepared as described in Example 10. Lanes 1-4 contain the uncut controls 
for the wild-type, mutant 1, mutant 2 and mutant 3 P-globin substrates, respectively. 
Lanes 5-8 contain the cleavage products from the wild-type, mutant 1, mutant 2 and 
30 mutant 3 substrates, respectively. 
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From the results shown in Figure 43, the Cleavase™ BN enzyme generates a 
unique pattern of cleavage products from each p-globin substrate tested. Differences 
in banding patterns are seen between the wild-type and each mutant; different banding 
patterns are seen for each mutant allowing not only a discrimination of the mutant 
from the wild-type but also a discrimination of each mutant from the others. 

The results shown here for the p-globin gene and above for the tyrosinase gene 
demonstrate that the Cleavase™ reaction provides a powerful new tool for the 
detection of mutated genes. 

EXAMPLE 16 

Treatment Of RNA Substrates Generates Unique Cleavage Patterns 

The present invention contemplates 5' nuclease cleavage of single- or double- 
stranded DNA substrates to generate a unique pattern of bands characteristic of a given 
substrate. The ability of the 5' nuclease activity of the Cleavase™ BN enzyme to 
utilize RNA as the substrate nucleic acid was next demonstrated. This experiment 
showed that RNA can be utilized as a substrate for the generation of a cleavage pattern 
using appropriate conditions (Lowering of the pH to 6.5 from 8.2 to reduce 
manganese-mediated degradation of the RNA substrate). The experiment was 

performed as follows. 

An RNA transcript internally labelled with biotin was produced to serve as the 
substrate. The plasmid pGEM3Zf (Promega) was digested with EcoBl. EcoM cuts 
the plasmid once generating a linear template. An RNA transcript 64 nucleotides in 
length (SEQ ID NO:73) was generated by SP6 transcription of the linearized template 
using a RIBOPROBE GEMINI System kit from Promega, Corp.; the manufacturer's 
directions were followed with the exception that 25% of the UTP in the reaction was 
replaced with biotin-UTP (Boehringer Mannheim) to produce an internally labelled 
transcript. Following the transcription reaction (1 hour at 37°C), the DNA template 
was removed by treatment with RQ1 RNase-free DNAse (from the RIBOPROBE 
GEMINI kit and used according to the manufacturer's instructions) and the RNA was 
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collected and purified by precipitating the sample twice in the presence of 2 M 
NH 4 OAc and ethanol. The resulting RNA pellet was rinsed with 70% ethanol, air 
dried and resuspended in 40 ul of 10 mM Tns-HCl, P H 8.0 and 1 mM EDTA. 

Cleavage reactions contained lul of the above RNA substrate and 50 ng of the 
5 Cleavase™ BN enzyme in 10 ul of IX RNA-CFLP™ buffer (10 mM MOPS, P H 6.3) 

and 1 mM of either MgCl 2 or MnCl 2 . The reactions were assembled with all the 
components except the enzyme and were warmed to 45°C for 30 sec. Reactions were 
started by the addition of 50 ng of the Cleavase™ BN enzyme in 1 ul of dilution 
buffer (20 mM Tris-HCl, pH 8.0, 50 mM KC1, 0.5% NP40, 0.5% TWEEN 20, 10 
10 p-g/ml BSA). Reactions proceeded for 10 mm and were stopped by the addition of 8 

ul of stop buffer. The samples were heated to 72°C for 2 minutes and 5 ul of each 
reaction were resolved by electrophoresis through a 10% polyacrylamide gel (19:1 
cross-link), with 7 M urea, in a buffer containing 45 mM Tns-Borate, pH 8.3, 1.4 mM 
EDTA. 

After electrophoresis, the gel plates were separated and overlaid with a nylon 
membrane as described in Example 10b. The DNA was transferred to the membrane 
and the membrane was dried, washed in 1.2X Sequenase IMAGES Blocking Buffer, 
treated with IX SAAP buffer and reacted with LUMIPHOS-530 (United States 
Biochemical) or Quantum Yield Chemiluminescent Substrate (Promega) and exposed 
to X-ray film as described in Example 10b. The resulting autoradiograph is shown in 
Figure 44. 

In Figure 44 , lanes marked "M" contain molecular weight markers. Lane 1 
contains the no enzyme control and shows the migration of the uncut substrate. 
Lanes 2 and 3 contain reaction products from the incubation of the RNA substrate in a 
25 buffer containing MgCl 2 in the presence or absence of the Cleavase™ BN enzyme, 

respectively. Lanes 4 and 5 contain reaction products from the incubation of the RNA 
substrate in a buffer containing MnCl 2 in the presence or absence of the Cleavase™ 
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BN enzyme, respectively. A pattern of cleavage products is seen when the enzyme is 
incubated with the RNA substrate in the presence of MnCl 2 , (lane 5). 

These results show that the Cleavase™ enzyme can be used to probe RNA 
substrates for changes in sequence (i.e., point mutations, deletions, substitutions). This 
capability enables the examination of genes which have very large introns (e.g., greater 
than 10 kb) interrupting the coding sequences. The spliced RNA transcript represents 
a simpler target for the scanning for mutations. In addition, the structural (,\e., 
folding) information gained by cleavage of the RNA would be useful in targeting of 
hybridization or ribozyme probes to unstructured regions of RNAs. Furthermore, 
because the cleavage reaction occurs so quickly, the Cleavase™ enzyme can be used to 
study various types of RNA folding and the kinetics with which this folding occurs. 
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EXAMPLE 17 

The 5' Nuclease Activity From Both Cleavase™ BN Enzyme And Tag Polymerase 
Generates Unique Cleavage Patterns Using Double-Stranded DNA Substrates 

The ability of both the Cleavase™ BN enzyme and Tag polymerase to generate 
cleavage patterns on single-stranded DNA templates was examined. The substrates 
utilized in this experiment were the 378 nucleotide fragment from either the wild-type 
(SEQ ID NO:56) or 422 mutant (SEQ ID NO:57) tyrosinase gene. These single- 
stranded substrates were generated as described in Example 12a. 

Cleavage reactions were performed as described in Example 12b with the 
exception that half of the reactions were cut with the Cleavase™ BN enzyme as 
described and a parallel set of reaction was cut with Tag polymerase. The Tag 
polymerase reactions contained 1.25 units of Tag polymerase in IX CFLP™ buffer. 
The reaction products were resolved by electrophoresis and the autoradiograph was 
generated as described in Example 12b. The autoradiograph is shown in Figure 45. 

In Figure 45, lanes marked "M" contain biotinylated molecular weight markers. 
Lanes 1 and 2 contain the wild-type or 422 mutant substrate cleaved with the 
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Cleavase™ BN enzyme, respectively. Lanes 3 and 4 contain the wild-type or 422 
mutant substrate cleaved with Tag polymerase, respectively. 

Figure 45 shows that both the Cleavase™ BN enzyme and Tag polymerase 
generate a characteristic set of cleavage bands for each substrate allowing the 
differentiation of the wild-type and 422 mutant substrates. The two enzyme produce 
similar but distinct arrays of bands for each template. 

These results show that the 5' nuclease of both the Cleavase™ BN enzyme and 
Tag polymerase are useful for practicing the cleavage reaction of the invention. 
Cleavage with Tag polymerase would find application when substrates are generated 
using the PCR and no intervening purification step xs employed other than the removal 
of excess nucleotides using alkaline phosphatase 



EXAMPLE 18 

Multiplex Cleavage Reactions 

The above Examples show that the cleavage reaction can be used to generate a 
15 characteristic set of cleavage products from single- stranded DNA and RNA substrates. 

The ability of the cleavage reaction to utilize double-stranded DNA templates was 
examined. For many applications, it would be ideal to run the cleavage reaction 
directly upon a double- stranded PCR product without the need to isolate a single- 
stranded substrate from the initial PCR. Additionally it would be advantageous to be 
20 able to analyze multiple substrates in the same reaction tube ("multiplex" reactions). 

Cleavage reactions were performed using a double-stranded template which was 
carried a 5' biotin label on the sense-strand and a 5' fluorescein label on the anti-sense 
strand. The double-stranded substrate was denatured prior to cleavage. The double- 
stranded substrate was cleaved using Tag polymerase. Tag polymerase was used in 
25 this experiment because it has a weaker duplex-dependent 5' to 3' exonuclease activity 

than does the Cleavase™ BN enzyme and thus Tag polymerase does not remove the 5' 
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end label from the re-natured DNA duplexes as efficiently as does the Cleavase™ BN 
enzyme; therefore less signal is lost in the reaction. 

The substrate utilized was a 157 bp fragment derived from either the wild-type 
(SEQ ID NO:47), 419 mutant (SEQ ID NO:54) or 422 mutant (SEQ ID NO:55) of the 
tyrosinase gene. The wild-type fragment was generated as described in Example 10a, 
the 419 mutant fragment was generated as described in Example 10a and the 422 
mutant fragment was generated as described in Example 11 using PCR. The sense 
strand primer (SEQ ID NO:42) contains a 5' biotin label and the anti-sense primer 
(SEQ ID NO:43) contains a 5' fluorescein label resulting in the generation of a 
double-stranded PCR product label on each strand with a different label. The PCR 
products were gel purified as described in Example 10a. 

The cleavage reactions were performed as follows. Reaction tubes were 
assembled with approximately 100 fmoles of the double-stranded DNA substrates in 
5 ul of IX CFLP™ buffer, 1 mM MnCl 2 . The solutions were overlaid with a drop of 
mineral oil. The tubes were heated to 95°C for 30 sec and 1 unit of Tag polymerase 
(Promega) was added. Uncut controls contained 33 fmoles of double- stranded DNA 
substrates in 5 ul of IX CFLP™ buffer, 1 mM MnCl, The reactions were cooled to 
65°C and incubated at this temperature for 15 minutes. The reactions were stopped by 
the addition of 8 ul of stop buffer. The samples were heated to 72°C for 2 min and 5 
ul of reaction were resolved by electrophoresis through a 10% polyacrylamide gel 
(19:1 cross-link), with 7 M urea in a buffer containing 45 mM Tris-Borate, pH 8.3, 
1 4 mM EDTA. The entire set of reactions was loaded in duplicate on the gel such 
that duplicate nylon membranes containing the full set of reactions were created. After 
transfer to a nylon membrane (performed as described in Example 10a), the membrane 
was cut in half; one half was probed using a streptavidin-alkaline phosphatase 
conjugate to visualize the biotinylated sense-strand products (as described in Example 
10a) The other half of the membrane was probed with an anti-fluorescein antibody- 
alkaline phosphatase conjugate to visualize the fluorescem-labelled anti-sense strand 
products (as described in Example 5). The blots were visualized using the 
chemiluminescent procedures described in Examples 10a and 5 for biotin-labeled or 
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fluorescein-labeled DNA, respectively. The autoradiographs are shown side-by side in 
Figure 46. 

In Figure 46, the lane labeled "Ml" contains biotinylated molecular weight 
markers prepared as described in Example 10, The lane labeled "M2" contains 
molecular weight markers generated by digestion of P UC19 with Mspl, followed by 
Klenow treatment to fill-in the ends. The blunt ends were then labeled with 
fluoresceinated dideoxynucleotides (Boehringer Mannherm) using terminal transferase 
(Promega). Lanes Ml and 1-6 were developed using the protocol for biotinylated 
DNA. Lanes 7-12 and M2 were developed using the protocol for fluorescein-labeled 
DNA. Note that in all lanes both strands of the substrate are present; only one strand 
is visualized in a given development protocol. 

In Figure 46, lanes 1-3 and 7-9 contain the "no enzyme" or uncut controls 
using the wild-type, 419 or 422 mutant substrates, respectively. Lanes 4-6 and 10-12 
contain cleavage products from the wild-type, 419 or 422 mutant substrates, 
respectively. Unique patterns of cleavage products are seen for each strand of each of 
the three substrates examined. Thus, a single reaction allowed the generation of a 
unique fingerprint from either the sense or anti-sense strand of each of the three 

tyrosinase fragments tested. 

The results shown in Figure 46 demonstrate that a cleavage pattern can be 
generated from a double- stranded DNA fragment by denaturing the fragment before 
performing the cleavage reaction. Note that in Figure 46 the cleavage patterns were 
generated in the course of a single round of heating to denature and cooling to cleave 
and that much of the substrate remains in an uncut form. This reaction would be 
amenable to performing multiple cycles of denaturation and cleavage in a 
thermocycler. Such cycling conditions would increase the signal intensity seen for the 
cleavage products. Substrates generated by the PCR performed in the standard PCR 
buffer (50 mM KC1, 10 mM Tns-Cl, P H 8.3, 1.5 mM MgCl 2 , 0.01% gelatin) can be 
treated to remove remaining dNTPs (e.g., addition of alkaline phosphatase) and to 
provide Mn 2+ . Under these conditions the cleavase reaction can be performed on both 
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strands of one or more products generated in that PCR. Such a protocol reduces 
sample preparation to a minimum resulting in a savings of both time and expense. 

The above example also demonstrates that two distinct substrates can be 
analyzed in a single reaction thereby allowing the "multiplexing" of the cleavage 
reaction. 
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EXAMPLE 19 

Optimization Of Manganese Ion Concentration 
For Cleavage Of Double Stranded DNA Substrates 

As discussed above, it may be desirable to run the cleavage reaction on double- 
stranded DNA substrates such restriction fragments or segments generated by balanced 
or symmetric PCR. The effect of varying the concentration of Mn 2+ in cleavage 
reactions using double- stranded DNA substrates was investigated. The results shown 
below demonstrate that the optimal concentration of Mn 2+ is lower when a double- 
stranded DNA substrate is employed in the cleavage reaction as compared to single- 

stranded DNA substrates. 

Two double-stranded (ds) DNA substrates, 157 bp in length, derived from the 
tyrosinase mutants 419 (SEQ ID NO:40) and 422 (SEQ ID NO:84) were prepared by 
PCR amplification of the exon 4 region of human tyrosinase gene as described above 
in Example 18. The sense strand of the 419 and 422 tyrosinase mutant substrates 
contained a biotin-labeled at the 5' end following the PCR. The PCR products were 
gel purified as described in Example 10a. 

The cleavage reactions were performed as follows. Reaction tubes were 
assembled with approximately 40 fmoles of the ds DNA substrates in 10 ul of water. 
The tubes were brought to 95°C for 10 seconds in a PTC-100™ Programmable 
Thermal Controller (MJ Research, Inc.) to denature the DNA. The cleavage reactions 
were started by the addition of 10 ul of 2X CFLP™ buffer (pH 8.2) containing 1 ul of 
the Cleavase™ BN enzyme (25 ng in IX dilution buffer) and different concentrations 
of MnCl 2 such that the final concentration of MnCl 2 in reaction mixture (20 ul final 
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volume) was e lt her 0.5 mM, 0.25 mM, 0.15 mM, 0.1 mM, 0.05 mM and 0 mM. After 
mixing, the samples were immediately cooled to 65°C and incubated at this 
temperature for 5 minutes. The reactions were terminated by placing the samples on 
ice and adding 10 ,1 of stop buffer. The samples were heated to 85°C for 30 sec and 
10 ,1 of each reaction were resolved by electrophoresis through a 10% polyacrylarmde 
gel (19:1 cross-link), with 7 M urea in a buffer containing 45 mM Tns-Borate, pH 8.3, 
1.4 mMEDTA. 

After electrophoresis, the gel plates were separated and overlaid with a nylon 
membrane as described in Example 10a. The DNA was transferred to the membrane 
and the membrane was dried, washed in IX Sequenase IMAGES Blocking Buffer 
(USB) treated with IX SAAP buffer and reacted with CDP-Star™ (Tropix) and 
exposed to X-ray film as described in Example 10a. The resulting autoradiograph is 
shown in Figure 47. 

In Figure 47, the lane marked "M" contains molecular weight markers prepared 
described in Example 10. Lanes 1-6 contain the cleavage products generated by 
cleavage of the 419 mutant and lanes 7-12 contain the cleavage products generated by 
cleavage of the 422 mutant. The reaction products generated by cleavage of the ds 
DNA substrates in IX CFLP™ buffer containing 0.5 mM (lanes 1,7); 0.25 mM (lanes 
2,8); 0.15 mM (lanes 3,9); 0.1 mM (lanes 4,10); 0.05 mM (lanes 5,11) and 0 mM 

20 MnCl 2 (lanes 6,12) are shown. 

The results shown in Figure 47 show no cleavage is seen in the absence of 
divalent cations as is also the case for cleavage of ss DNA substrates [see Example 
13(a) and Figure 34]. Optimal cleavage (i.e., production of the most distinct pattern of 
clevage fragments) of ds DNA substrates was seen in the presence of 0.25 mM 
MnCl 2 This optimum is considerably lower than that obtained using ss DNA 
substrates [Example 13 and Figure 34 show that cleavage of ss DNA substrates was 
optimal m 1 mM MnCl,]. Figure 47 shows that the efficiency of cleavage of ds DNA 
substrates decreases as the concentration of MnCl 2 is lowered; this effect is likely due 
to the lower efficiency of the enzyme in decreasing concentrations of MnCl 2 . 
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Figure 47 shows that the cleavage pattern for dsDNA substrates apparently 
disappears when high concentrations of MnCl 2 (0.5 mM, lanes 1 and 7) are employed 
in the cleavage reaction. This result is in contrast to the results obtained when 
cleavage reactions are performed on single-stranded DNA (ssDNA) substrates. 
5 Example 13(a) showed that efficient cleavage of ss DNA substrates were obtained in 1 

mM MnCl 2 and little change in the cleavage pattern was seen when the Mn 2+ 
concentration varied from 1 to 4 mM. 

The loss of the signal seen when ds DNA substrates are cleaved in buffers 
containing high concentrations of MnCl 2 may be explained as follows. The presence 
10 of high concentrations of divalent ions promotes the reanneallmg of the DNA strands 

of the ds substrate during the course of the cleavage reaction. The Cleavase™ BN 
enzyme can nibble ds DNA substrates from the 5' end (i.e., the enzyme removes short 
DNA fragments from the 5' end in an exonucleolytic manner; see Example 6). This 
nibbling results in the apparent removal of the label from the substrate DNA (as the 
15 DNA contains a 5' end label). Very short DNA fragments which contain the 5' end 

label are not visualized as they run out of the gel or are not efficiently transferred to 
the membrane. 

EXAMPLE 20 

Detection of Cleavage Patterns Can Be Automated 

The ability to detect the characteristic genetic fingerprint of a nucleic acid 
substrate generated by the cleavage reaction using fluorescently labelled substrates in 
conjunction with automated DNA sequencing instrumentation would facilitate the use 
of the CFLP™ method in both clinical and research applications. This example 
demonstrates that differently labelled isolates (two different dyes) can be cleaved in a 
25 single reaction tube and can be detected and analyzed using automated DNA 

sequencing instrumentation. 
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Double-stranded DNA substrates, which contained either the N-hydroxy 
succinimidyl ester JOE-NHS (JOE) or FAM-NHS (FAM) on the sense-strand, were 
generated using the PCR and primers labelled with fluorescent dyes. The anti-sense 
strand contained a biotin label. The substrates utilized in this experiment were the 157 
bp fragments from the wild-type (SEQ ID NO:40) and 422 mutant (SEQ ID NO:55) of 

exon 4 of the tyrosinase gene. 

The wild-type and 422 mutant tyrosinase gene substrates were amplified from 
cDNA plasmid clones containing either the wild-type [pcTYR-NlTyr, Bouchard, B., et 
al. (1989) J. Exp. Med. 169:2029] or the 422 mutant [pcTYR-A422, Giebel, L.B., et 
al. (1991) 87:1119] forms of the tyrosinase gene. Each double- stranded substrate was 
amplified and the 5' ends labelled with either a biotin moiety or a fluorescent dye by 
using the following primer pairs in the PCR. The anti-sense primer of SEQ ID NO:43 
containing a 5'-biotin moiety was obtained from Integrated DNA Technologies, Inc. 
(IDT, Coralville, IA). The biotinylated anti-sense primer was used to prime the 
synthesis of both the wild-type and 422 mutant substrates. The sense primer of SEQ 
ID NO:42 labelled with JOE was used to prime synthesis of the wild-type tyrosinase 
gene The sense primer of SEQ ID NO:42 labelled with FAM was used to prime 
synthesis of the 422 mutant tyrosinase gene. The JOE and FAM-labelled primers were 
obtained from Genset (Paris, France). 

The PCR reactions were carried out as follows. One to two nanograms of 
plasmid DNA from the wild-type or 422 mutant were used as the target DNA in a 100 
Hi reaction containing 50 uM of each dNTP, 1 uM of each primer in a given primer 
pair, 20 mM Tns-Cl, P H 8.3, 1.5 mM M g Cl 2 , 50 mM KC1, 0.05% TWEEN 20 and 
0.05% NONIDET P-40. Tubes containing the above mixture were overlaid with 70 *il 
Chill Out 14™ liquid wax (MJ Research, Watertown, MA). The tubes were heated to 
95°C for 1 min and then cooled to 70°C. Tag DNA polymerase (Perkin-Elmer) was 
then added as 2.5 units of enzyme in 5 ul of a solution containing 20 mM Tris-Cl, pH 
8.3, 1.5 mM MgCl 2 , 50 mM KC1, 0.05% TWEEN 20 and 0.05% NONIDET P-40. 
The tubes were heated to 95°C for 45 sec, cooled to 50°C for 45 sec, heated to 72°C 



- U 



for 1 min and 15 sec for 35 repetitions. Following the last repetition, the tubes were 

incubated at 72°C for 5 min. 

The PCR products were gel purified as follows. The products were resolved by 
electrophoresis through a 6% polyacrylamide gel (29:1 cross-link) in a buffer 
5 containing 45 mM Tns-Borate, P H 8.3, 1.4 mM EDTA. The DNA was visualized by 

ethidium bromide staining and the 157 bp fragments were excised from the gel. The 
DNA was eluted from the gel slices by passive diffusion overnight into a solution 
containing 0.5 M NH 4 OAc, 0.1% SDS and 0.1 M EDTA. The DNA was then 
precipitated with ethanol in the presence of 4 ug of glycogen carrier. The DNA was 

10 pelleted and resuspended in 30 ul of H 2 0. 

The cleavage reactions were performed as follows. Approximately 100 fmoles 
of each double-stranded DNA substrate (1-3 ul of each gel purified DNA) in a total 
volume of 6 ul in H 2 0 was placed in a 500 ul thin wall microcentrifuge tube (Perkm- 
Elmer). The tube was heated to 95°C for 10 seconds to denature the substrates and 
15 then the tube was quickly cooled to 50°C (this step allows the DNA to assume its 

unique secondary structure by allowing the formation of mtra-strand hydrogen bonds 
between complimentary bases). The cleavage reaction was started by adding 2 ul of 
50 mM MOPS <pH 7.2), 1 ul of 1 mM MnCl 2 and 1 ul of Cleavase™ BN enzyme (50 
ng/ul) The cleavage reaction was performed in a thermocycler (Perkin-Elmer DNA 
20 Thermal Cycler 480, Norwalk, CT) programmed to heat to 95°C for 10 seconds and 

then cooled immediately to 50°C. The reaction was then incubated at 50°C for 5 
minutes and stopped by the addition of 1 ul of 10 mM EDTA. 

Following the cleavage reaction, the sample was resolved by gel electrophoresis 
using an ABI 373A DNA Sequencer (Foster City, CA). Prior to loading, the sample 
25 was denatured by adding 5 ul of a solution containing 95% formamide and 10 mM 

EDTA and heating to 90°C for 2 minutes. Five microliters of the sample was resolved 
by electrophoresis through a 6% polyacrylamide gel (19:1 cross-link), with 6 M urea, 
in IX TBE buffer (89 mM Tris-Borate, pH 8.3, 2 mM EDTA). The gel was run at 30 
watts for 14 hours. Signals from four wavelength channels were collected using the 
30 Applied Biosystem Data Collection program on a Macintosh computer. The raw data 
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was analyzed with the BaseFinder program [Giddings, M, e, al (1993) Nucl. Ac.ds 
Res. 21:4530] whieh eorreets for the fluorescent spectrum overlap in the four channel 
signals and mobility shifts caused by the use of different dye labels. 

The results are shown in Figure 48. Figure 48 shows two traces representing 
the two channel signals for the wild-type and mutant samples. The w,ld-type sample, 
which was labeled with JOE dye, is shown by the thin lines. The mutant sample 
(R422Q) which was labeled with FAM dye, is shown by the thick lines. For 
comparison, a photograph of a high resolution polyacrylam.de gel (10% gel with 19:1 
crosslink) containing the reso,ved cleavage products is shown above the traces (the top 
,a„e contains cleavage fragments produced by clevage of the wild-type substrate; the 
bottom lane contains cleavage fragments produced by clevage of the R422Q mutant 
substrate). The cleavage products shown in the gel, whieh contain biotin labels at the 
5- end of the sense strand, were generated, transferred to a nylon membrane and 
visualized as described in Example 10a. Arrows point from selected bands seen upon 
cleavage of the 422 mutant substrate to the corresponding peaks in the trace generated 
by the automated DNA sequencer (the arrows are labelled 1 through 7 beginning w.th 

the left-hand side of Figure 48). 

Comparison of the two traces shows that differences in the cleavage patterns 
generated from the cleavage of the wild-type and 422 mutant substrates in the same 
reaction are detected using automated DNA sequencing instrumentation. For example, 
cleavage of the 422 substrate generates a cleavage product of approximately 56 
nucleotides which is not seen when the wild-type substrate is cleaved. This 56 
nucleotide product is seen as the peak depicted by arrow 6 in Figure 48. Figure 48 
shows that not only are new cleavage products generated by cleavage of the mutant 
substrate, but that the cleavage of certain structures is enhanced in the mutant substrate 
as compared to the wild-type substrate (compare the intensity of the peaks 
corresponding to arrows 2-5 in the wild-type and mutant traces). In addition, certain 
cleavage products are shared between the two substrates and serve as reference 
markers (see arrows 1 and 7). 
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The above results show that automated DNA sequencing instrumentation can be 
used to detect the characteristic genetic fingerprint of a nucleic acid substrate generated 
by the cleavage reaction. The results also demonstrate that the cleavage reaction can 
be run as a multiplex reaction. In this experiment both the wild-type and the mutant 
ds DNA substrates were cleaved in the same reaction (i.e., a multiplex reaction) and 
then were resolved on the same electrophoretic run using an automated DNA 
sequencer. 

EXAMPLE 21 

Identification of Viral Strains Using the Cleavase™ Reaction 

The above examples demonstrate that the Cleavase™ reaction could be used to 
detect single base changes in fragments of varying s.ze from the human p-globm and 
tyrosinase genes. These examples showed the utihty of the Cleavase™ reaction for the 
detection and characterization of mutations in the human population. The ability of 
the Cleavase™ reaction to detect sequence variations characteristic of different stratus 

of a virus was next examined. 

The simian immunodeficiency virus (SIV) infection of monkeys is a widely 
used animal model for the transmtssion of human immunodeficiency virus type-1 
(HIV) in humans. Biological isolates of SIV contain multiple virus strains. When a 
monkey is infected with a biological isolate of SIV, unique subsets of the virus stock 
are recovered from the infected animals (specific strains are also able to infect tissue 
culture cells). Different genotypes of the virus are isolated from infected annuals 
depending on the route of infection [Trivedi, P. e, al Journal of Virology 68:7649 
(1994)] The SIV long terminal repeat (LTR) contains sequences which vary between 
the different viral strains and can be used as a marker for the identification of the v.ral 
genotype. 

In order to develop a rapid method for the identification of viral stra.n(s) m a 
sample («*, a clinical isolate), the Cleavase™ reaction was used to characterized SIV 
genotypes isolated after infection of cultured cells in vitro or after infection of rhesus 
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monkeys by either intravenous or intrarectal inoculation with uncloned biological SIV 
stocks . Six clones generated from viral DNA isolated following in vitro infection of 
the CEMxl74 cell line (L.CEM/251/12 clone), after intravenous inoculation of 
monkeys (L100.8-1 clone), after intrarectal low-dose inoculation of monkeys (L46.16- 

5 10 and L46.16-12 clones) and after intrarectal high-dose inoculation of monkeys 

(L19.16-3 and L36.8-3 clones) were obtained from C. David Pauza (Wisconsin 
Primate Research Center, Madison, WI). These clones were generated as described by 
Trivedi, P. et al. Journal of Virology 68:7649 (1994). These plasmid clones contained 
viral LTR sequences and were utilized to generate double-stranded DNA (ds DNA) 

10 substrates for the cleavage reaction. 

a) Preparation Of The Substrate DNA 

The six SIV plasmids were utilized as templates in PCRs in order to generate 
dsDNA substrates for the cleavage reaction. The primer pair utilized spans the U3-R 
boundary in the SIV LTR and amplifies an approximately 350 bp fragment. This 
15 portion of the SIV LTR contains recognition sequences for transcription factors 

(including Spl and NF-kB) as well as the TATA box for transcription initiation and is 
polymorphic in different viral strains [Trivedi, P. et al, supra]. 

The primer pair consisting of SEQ ID NOS:74 and 75 was used to amplify the 
SIV LTR clones in the PCR. SEQ ID NO:74 primes synthesis of the (+) strand of the 
20 SIV LTR and comprises 5 '-GGCTGACAAGAAGGAAACTC-3 ' . SEQ ID NO:75 

primes synthesis of the (-) strand of the SIV LTR and comprises 5'-CCAGGCGGCG 
GCT AGGAGAGATGGG-3 ' . To visualize the cleavage pattern generated by cleavage 
of the (+) strand of the LTR, the PCR was performed using the primer consisting of 
SEQ ID NO: 74 containing a biotin label at 5' end and unlabeled primer consisting of 
25 SEQ ID NO:75. To visualize the cleavage pattern generated by clevage of the (-) 

strand of the viral LTR, the PCR was performed using the primer pair consisting of 
SEQ ID NO:75 containing a biotin label at the 5' end and unlabeled primer SEQ ID 
NO:74. 
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The PCR reactions were carried out as follows. Ten to twenty nanograms of 
plasmid DNA from each of the above 6 SIV LTR clones was used as the target DNA 
in separate 100 ul reactions containing 60 uM of each dNTP, 0.2 uM of each primer 
in a given primer pair, 10 mM Tris-Cl, P H 9.0 (at 25°C), 2 mM MgCl 2 , 50 mM KC1, 
with 0.1% Triton X-100. Tubes containing the above mixture were overlaid with two 
drops of light mineral oil and the tubes were heated to 96°C for 3 mm and Tag DNA 
polymerase (Perkin-Elmer) was then added as 2.5 units of enzyme in 0.5 ul of 20 mM 
Tris-HCl, pH 8.0, 100 mM KC1, 0.1 mM EDTA, 1 mM DTT, 50% glycerol and 0.5% 
TWEEN 20 and 0.5% NONIDET P-40. The tubes were heated to 96°C for 45 
seconds, cooled to 60°C for 45 seconds, heated to 72°C for 1 minute for 35 
repetitions. Following the PCR, the reaction mixture was separated from the mineral 
oil and 5 ul of 5M NaCl, 4 ul of 10 mg/ml glycogen and 250 ul of 100% ethanol 
were added to the aqueous PCR samples. After incubation at -20°C for 1 hour, the 
DNA was pelleted by centnfugation in a MARATHON MICRO A centrifuge (Fisher 
Scientific) at maximum speed for 5 minutes and resuspended in 40 ul of 10 mM Tris- 
HCl, pH 8.0, 0.1 mM EDTA. 

The PCR products were gel purified as follows. The DNA was mixed with 0.5 
volume of loading buffer (95% formamide, 5mM EDTA, 0.02% bromphenol blue, 
0.02% xylene cyanol) and heated to 75°C for 2 minutes. The products were resolved 
by electrophoresis through a 6% polyacrylamide denaturing gel (19:1 cross-link) in a 
buffer containing 7M urea, 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA. The DNA 
was visualized by ethidium bromide staining and the product bands were excised from 
the gel. The DNA was eluted from the gel slices by passive diffusion overnight into a 
solution containing 0.5 M NH 4 OAc, 0.1% SDS and 0.1 M EDTA. The DNA was then 
25 precipitated with ethanol in the presence of 4 u£ of tRNA carrier. The DNA was 

pelleted and resuspended in 50 ul of 0.2 M NaCl, 10 mM Tris-HCl, P H8.0, 0.1 mM 
EDTA. The DNA was precipitated with ethanol and resuspended in 50 ul of 10 mM 
Tris-HCl, pH 8.0, 0.1 mM EDTA. The final DNA concentration was estimated to be 
40 fmole/ul for each double-stranded SIV LTR PCR product. 
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b) DNA Sequence Analysis Of The SIV LTR PCR 
Products 

The DNA sequence of the six PCR fragments generated in section a) above was 
determined using the fmoF™ DNA Sequencing System (Promega) according to the 
5 manufacturer's instructions. For each set of the sequencing reactions 0.2 pmoles of the 

PCR product and 2 pmoles of one of the two 5'-biotinylated PCR primers SEQ ID 
NOS:74 and 75 was used (i.e., both strands of the PCR fragments were sequenced). 
Following the sequencing reactions, the sequencing products were resolved by 
electrophoresis. After electrophoresis, the DNA bands were visualized following 
10 transfer to a nylon membrane as described in Example 19 with the following 

modification. A solution containing 0.2% Blocking reagent (Boehringer-Mannheim) 
and 0.2% SDS in TBS buffer (100 mM Tns-HCl, pH7.4; 68 mM NaCl) was used in 
place of the IX Sequenase IMAGES Blocking Buffer (USB). 

The sequence of the 351 bp fragment derived from the L100.8-1 LTR clone is 
15 listed in SEQ ID NO:76. The sequence of the 340 bp fragment from the L46.16-10 

LTR clone is listed in SEQ ID NO:77. The sequence of the 340 bp fragment derived 
from the L46.16-12 LTR clone is listed in SEQ ID NO:78. The sequence of the 351 
bp fragment from the L19.16-3 LTR clone is listed in SEQ ID NO:79. The sequence 
of the 351 bp fragment derived from the LCEM/251/12 LTR clone is listed in SEQ ID 
20 NO:80. The sequence of the 351 bp fragment derived from the L36.8-3 LTR clone is 

listed in SEQ ID NO:81. 

Analysis of sequenced LTR fragments shows that they have multiple 
substitutions and a deletion relative to the L100.8-1 LTR sequence (SEQ ID NO:76); 
the L100.8-1 LTR sequence was chosen as a reference to permit comparisons between 
25 the six LTR clones. For the ease of discussion, the first or 5'-terminal nucleotide of 

the (+) strand of L100.8-1 LTR sequence (SEQ ID NO:76) is defined as number 1 
and the last or 3 '-terminal nucleotide of this sequence is defined as number 351. 

Figure 49 displays the nucleotide sequence of the six LTR clones. The 
reference clone, L.100.8-1 (SEQ ID NO:76), is shown on the top line. Sequences 
30 appearing in bold type represent sequence changes relative to the sequence of clone 
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L. 100.8-1 (SEQ ID NO:76). The sequences outlined by the brackets in Figure 49 
represent palindromic sequences which can form a very stable hairpin structure having 
a stem of 14 base pairs (12/14 bases in the stem are complementary) and a loop of 7 
nucleotides in the reference clone L.100.8-1 (SEQ ID NO:76). This hairpin structure 
is present in all six LTR clones although the sequence of the stem and loop structures 

varies between the clones. 

In comparison with L100.8-1 sequence (SEQ ID NO:76), the L46.16-10 
sequence (SEQ ID NO:77) has seven substitutions and one 11 nucleotide deletion 
corresponding to nucleotides 65-75 of SEQ ID NO:76. The substitutions are: C to T 
in position 28 (C28T), C57T, G90A, C97T, G238A, G242A and G313A. The L46.16- 
12 sequence (SEQ ID NO:78) has seven substitutions and one 11 nucleotide deletion 
corresponding to nucleotides 65-75 of SEQ ID NO:76. The substitutions are: C28T, 
C57T, G90A, C97T, A103G, G242A and G313A. L19. 16-3 sequence (SEQ ID 
NO:79) has two substitutions: A94C and A317T. LCEM/251/12 sequence (SEQ ID 
NO:80) has seven substitutions: G26A, G72A, C97T, G258A, A281C, G313A and 
C316T. L36.8-3 sequence (SEQ ID NO:81) has six substitutions: G60A, G172A, 
G207A, G221A, T256C and C316T. 
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c) Cleavage Reaction Conditions And CFLP™ Analysis 
Of The (-) Strand Of The SIV LTR 

Double-stranded substrates corresponding to the SIV LTR sequences listed in 
SEQ ID NOS:76-81 were labelled on the (-) strand using the PCR and the primer pair 
corresponding to SEQ ID NO: 74 and 75. The primer of SEQ ID NO:75 [the (-) 
strand primer]contained a biotin label at the 5' end. The PCR was performed and the 
reaction products were isolated as described in section a). 

The cleavage reactions were performed as follows. Reaction tubes were 
assembled with approximately 60 fmoles of the ds DNA substrates in 6 ul of water. 
The following reagents were added to the DNA: 2 ul of 5X CFLP™ buffer (pH 7.2) 
containing 150 mM KC1 (to yield a final concentration of 30 mM KC1) and 1 ul of the 
Cleavase™ BN enzyme (25 ng in IX dilution buffer). A reaction tube containing the 
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above components with the exception that 1 ul of H 2 0 was added in place of the 
Cleavase™ BN enzyme was prepared and run as the uncut or no enzyme control. The 
tubes were brought to 95°C for 10 seconds in a PTC-100™ Programmable Thermal 
Controller (MJ Research, Inc.) to denature the DNA. Following the denaturation step, 
5 the tubes were immediately cooled to 40°C. The cleavage reaction was immediately 

started by the addition of 1 ul of 2 mM MnCl 2 (to achieve a final concentration of 0.2 
mM). The tubes were incubated at 40°C for 5 minutes. The reactions were 
terminated by adding 6 ul of stop buffer. The samples were heated to 85°C for 30 sec 
and 5 |xl of each reaction were resolved by electrophoresis through a 12% 
10 polyacrylamide gel (19:1 cross-link), with 7 M urea in a buffer containing 45 mM 

Tris-Borate, pH 8.3, 1.4 mM EDTA. 

After electrophoresis, the gel plates were separated and overlaid with a nylon 
membrane as described in Example 10a. The DNA was transferred to the membrane 
and the membrane was dried, washed in 0.2% Blocking reagent (Boehringer 
15 Mannheim); 0.2% SDS in 100 mM Tris-HCl, pH 7.4; 68 mM NaCl, treated with IX 

SAAP buffer and reacted with CDP-Star™ (Tropix) and exposed to X-ray film as 
described in Example 10a. The resulting autoradiograph is shown in Figure 50. 

Figure 50 shows the cleavage patterns which correspond to the cleavage of the 
(-) strand of the double-stranded LTR substrates. In Figure 50, the lane marked "M" 
20 contains molecular weight markers (prepared as described in Example 10). Lanes 1-6 

contain the cleavage products generated by cleavage of the L100.8-1, L46.16-10, 
L46.16-12, L19.16-3, LCEM/251/12 and L36.8-3 LTR PCR fragments, respectively. 
Lanes 7-12 contain the uncut controls of each of the 6 LTR substrates in the same 
order as that described for Lanes 1-6. 
25 The results shown in Figure 50 show that the cleavage or CFLP™ pattern for 

each LTR substrate contains multiple bands which range in size from approximately 
350 nucleotides (the uncut substrate) to less than 24 nucleotides. The bands located 
below about 100 nucleotides in length show differences between the six clones which 
reflect nucleotide changes characteristic of the different SIV LTR isolates. 
30 Examination of the CFLP™ patterns revealed that the reaction detected five unique 
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cleavage patterns among the six SIV LTR isolates. From the DNA sequenee data, it 
was known that all six LTR clones were unique. However, the CFLP™ pattern 
appeared to be identical for the clones shown in lanes 2 and 3. 

The CFLP™ patterns generated by cleavage of the (-) strand from all stx 
substrates contatn a strong band which corresponds to a fragment of approximately 100 
nucleotides in .ength. This band corresponds to cleavage of all six LTR substrates at 
the long palindromic sequence located 97 nucleotides from the 5' end of the (-) strand 
(see the bracketed region in Figure 49). This palindromic sequence forms a very 
stable hairpin structure in single-stranded DNA and provtdes an optimal substrate for 
the Cleavase™ BN enzyme. Cleavage of this hairpin structure is predated to generate 
a fragment of approximately 100 nucleotides. 

The LTR substrates, L46.16-10 (SEQ ID NO:77) and L46.16-12 (SEQ ID 
N0 78) shown in lanes 2 and 3 were generated from the same ammal using the same 
route of infection [Trivedi, P. « aL, supra]. These substrates have identical sequences 
5 in the region corresponding to the detectable cleavage sites (i.e., below 100 

nucleotides) with the exception of a single base; the L46.16-10 clone (SEQ ID NO:77) 
contains a G to A change at position 239 (G239A) re.ative to the reference sequence 
listed in SEQ ID NO:76 . Examination of the DNA sequence of these two clones 
reveals that this substitution ts located in the loop region of a strong hairpin structure 
0 (see the palindromic region bracketed in Figure 49). Because the single base 

dtfference between these two sequences is located in the loop region of the hatrptn 
structure, it may not change DNA secondary structure of the two substrates suffictently 
to generate different CFLP™ patterns under the condttions utilized here. It may be 
possible to detect this single base difference between these two clones by varytng the 
25 reaction conditions in a way that destablizes the strong hairpin structure. 

The results shown in Figure 50 demonstrate that the CFLP™ reaction can be 
used to detect the majority (5/6 or 83%) of the sequence variations present in the stx 
SIV LTR clones studied. In addition, Figure 50 demonstrates that the CFLP™ 
reaction is a sensitive means for probing the secondary structure of single strands of 
30 nucleic acids. 
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d) Cleavage Reaction Conditions And CFLP™ Analysis 
Of The (+) Strand Of The SIV LTR 

Double-stranded substrates corresponding to the SIV LTR sequences listed in 
SEQ ID NOS:76-81 were labelled on the (+) strand using the PCR and the primer pair 
corresponding to SEQ ID NO: 74 and 75. The primer of SEQ ID NO:74 [the ( + ) 
strand primer] contained a biotin label at the 5' end. The PCR was performed and the 
reaction products were isolated as described in section a). The cleavage reactions, 
electrophoresis and DNA visualization were performed as described above in section 
c). The resulting autoradiograph is shown in Figure 51. 

Figure 51 shows the resulting pattern corresponding to the cleavage products of 
the (+) strand of six SIV LTR fragments. The lane marked "M" contains molecular 
weight markers (prepared as described in Example 10). Lanes 1-6 contain the 
cleavage products generated by cleavage of the L100.8-1, L46.16-10, L46.16-12, 
L19.16-3, LCEM/251/12 and L36.8-3 LTR PCR fragments, respectively. Lanes 7-12 
contain the uncut controls of each of the 6 LTR substrates in the same order as that 

described for Lanes 1-6. 

As was shown for cleavage of the (-) strand of the LTR clones, the CFLP™ 
pattern for each (+) strand of the SIV LTR substrates contains unique features that 
characterize the majority of specific nucleotide substitutions. For example, deletion of 

11 nucleotides can be easily detected for L46.16-10 (SEQ ID NO:77) and L46.16-12 
(SEQ ID NO:78) (Figure 51, lanes 2 and 3). This deletion removes one of the three 
Spl binding sites and is a change characteristic of the genotype of SIV which 
predominates in animals which are infected using low-doses of virus stock via 
intrarectal inoculation [Trivedi, P. et al, supra]. The CFLP™ pattern generated by 
cleavage of the (+) strand of the substrates derived from clones L46.16-10 and L46.16- 

12 again were identical under these reaction conditions. 
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The results shown above demonstrate that the CFLP™ reaction can be used as 
a means to rapidly identify different strains (i.e., genotypes) of virus. The ability to 
rapidly identify the particular strain of virus or other pathogenic organism in a sample 
is of clinical importance. The above results show that the CFLP™ reaction can be 
5 used to provide a fast method of strain or species identification. 

EXAMPLE 22 

The Effects Of Alterations In Salt Conditions In 
Cleavage Reactions Using A Single-Stranded DNA Substrate 

In Example 13 it was shown that the Cleavase™ reaction is insensitive to large 
10 changes in reactions conditions when a single-stranded DNA is employed as the 

substrate. Example 13 showed that the cleavage reaction can be performed using a 
range of salt concentrations (0 to 50 mM KC1) in conjunction with single- stranded 
substrates. In this example, the effect of substituting other salts in place of KC1 was 
examined in cleavage reactions using single-stranded DNA substrates. 

15 a ) Effect Of Substituting NaCl For KC1 In Cleavage 

Reactions Using A Single-Stranded Template 

To determine the effect of substituting NaCl in place of KC1 upon the cleavage 
pattern created by 5' nuclease activity on a single- stranded DNA substrate, the 
following experiment was performed. A single template was incubated in the presence 
20 of a fixed amount of the Cleavase™ BN enzyme (50 ng) in a buffer containing 10 

mM MOPS, pH 8.2, ImM MnCl 2 and various amounts of NaCl. 

Approximately 100 fmoles of the 157 nucleotide fragment derived from the 
sense strand of exon 4 of the tyrosinase gene (SEQ ID NO 47; prepared as described 
in example 10b) were placed in a 500 ul thin wall microcentrifuge tubes (Perkin 
25 Elmer, Norwalk, CT) in 1 X CFLP™ buffer, pH 8.2 and 1.33 mM MnCl 2 (to yield a 
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final concentration of 1 mM MnCl 2 ) in a volume of 15 NaCl was added to yield a 
final concentration of 0, 10, 20, 30, 40, 50, 75 or 100 mM. The final reaction volume 
was 20 

A tube containing IX CFLP™ buffer, pH 8.2, 1 mM MnCl 2 and 100 fmoles 
substrate DNA was prepared and served as the no salt, no enzyme control (sterile 
distilled water was substituted for Cleavase™ BN enzyme and all reaction components 
were added prior to denaturation at 95°C). 

The tubes were heated to 95°C for 20 seconds and then rapidly cooled to 65°C. 
The cleavage reaction was started immediately by the addition of 5 p.1 of a diluted 
enzyme mixture comprising 1 jxl of Cleavase™ BN [50 ng/ul in 1 X dilution buffer 
(0.5% NP40, 0.5% TWEEN 20, 20 mM Tris-HCl, pH 8.0, 50 mM KC1, 10 ug/ml 
BSA)] in IX CFLP™ buffer, pH 8.2 without MnCl 2 . 

After 5 minutes at 65°C, reactions were stopped by the addition of 16 \i\ of 
stop buffer (95% formamide, 10 mM EDTA, 0.05% bromophenol blue, 0.05% xylene 
cyanol). Samples were heated to 72°C for 2 minutes and 7 ul of each reaction were 
resolved by electrophoresis through a 10% polyacrylamide gel (19:1 cross-link), with 
7M urea, in a buffer containing 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA, as 

described in Example 10a. 

After electrophoresis, the gel plates were separated and overlaid with a nylon 
membrane, as described in example 10a. The DNA was transferred to the membrane 
and the membrane was dried, blocked in 1 X I-BLOCK (Tropix, Bedford, MA), 
conjugated with streptavidin-alkaline phosphatase (United States Biochemical), washed, 
reacted with CDP-STAR™ (Tropix, Bedford, MA) as described in Example 10a with 
the exception that 0.01 ml CDP-STAR™ was added per cm 2 of membrane. The 
membrane was exposed to X-ray film as described in Example 10a. The results are 
shown in Figure 52. 
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In Figure 52, the lane marked "M" contains molecular weight markers as 
described in example 10a. Lane 1 contains the no salt, no enzyme control and shows 
the mobility of the uncleaved template DNA. Lanes 2 through 9 contain reaction 
products incubated in the presence of Cleavase™ BN enzyme in a buffer containing 0, 
10, 20, 30, 40, 50 75 or 100 mM NaCl, respectively. 

The results shown in Figure 52 demonstrate that the substitution of NaCl in 
place of KC1 has little or no effect upon the cleavage pattern generated using the 157 
nucleotide tyrosinase DNA substrate (SEQ ID NO:47). Essentially the same 
dependence of the cleavage pattern on salt concentration was observed using this 
single-stranded DNA substrate when either KC1 (See example 13b, Figure35) or NaCl 
(Figure 52) was employed in the cleavage reaction. 



b) Effect Of Substituting (NH 4 ) 2 S0 4 For KC1 In Cleavage 
Reactions Using A Single-Stranded Template 

In an approach similar to that described in above in section a), the effect of 
15 substituting (NH 4 ) 2 S0 4 in place of KC1 upon the cleavage pattern created by 5' 

nuclease activity on a single-stranded DNA substrate was tested. Cleavage reactions 
were set up exactly as described in section a) with the exception that variable amounts 
of (NH 4 ) 2 S0 4 were used in place of the NaCl. 

Approximately 100 fmoles of the 157 nucleotide fragment derived from the 
20 sense strand of exon 4 of the tyrosinase gene (SEQ ID NO 47; prepared as described 

in example 10a) were placed in 500 ul thin wall microcentrifuge tubes (Perkin Elmer, 
Norwalk, CT) in 1 X CFLP™ buffer, pH 8.2 and 1.33 mM MnCl 2 (to yield a final 
concentration of 1 mM) in a volume of 15 ul. (NH 4 ) 2 S0 4 was added to yield a final 
concentration of 0, 10, 20, 30, 40, 50, 75 or 100 mM. The final reaction volume was 
25 20 ul. 

A tube containing IX CFLP™ buffer, pH 8.2, 1 mM MnCl 2 and 100 fmoles 
substrate DNA was prepared and served as the no salt, no enzyme control (sterile 
distilled water was substituted for Cleavase™ BN and all reaction components were 
added prior to denaturation at 95°C). 
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The tubes were heated to 95°C for 20 seconds and then rapidly cooled to 65°C. 
The cleavage reaction was started immediately by the addition of 5 ul of a diluted 
enzyme mixture comprising 1 ul of Cleavase™ BN [50 ng/ml in 1 X dilution buffer 
(0.5% NP40, 0.5% TWEEN 20, 20 mM Tris-HCl, pH 8.0, 50 mM KC1, 10 jig/ml 

5 BSA)] in IX CFLP™ buffer, pH 8.2 without MnCl 2 . 

After 5 minutes at 65°C, reactions were stopped by the addition of 16 ul of 
stop buffer (95% formamide, 10 mM EDTA, 0.05% bromophenol blue, 0.05% xylene 
cyanol). Samples were heated to 72°C for 2 minutes and 7 ul of each reaction were 
resolved by electrophoresis through a 10% polyacrylamide gel (19:1 cross-link), with 

10 7M urea, in a buffer containing 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA, as 

described in Example 10a. 

After electrophoresis, the DNA was transferred to a membrane and the detected 
as described in section a) above. The resulting autoradiograph is shown in Figure 53. 
In Figure 53, the lane marked "M" contains molecular weight markers as 
15 described in example 10a. Lane 1 contains the no enzyme control and shows the 

mobility of the uncleaved template DNA. Lanes 2 through 9 contain reaction products 
incubated in the presence of Cleavase™ BN enzyme in a buffer containing 0, 10, 20, 
30, 40, 50 75 or 100 mM (NH 4 ) 2 S0 4 , respectively. 

The results shown in Figure 53 demonstrate that the cleavage reaction is 
20 severely inhibited by the presence of (NH 4 ) 2 S0 4 . The reaction is completely inhibited 

by as little as 20 mM (NH 4 ) 2 S0 4 ; the extent of the cleavage reaction in 10 mM 
(NH 4 ) 2 S0 4 is comparable to that obtained in 50 mM KC1 or NaCl and is significantly 
reduced relative that obtained at 0 mM (NH 4 ) 2 S0 4 . The pattern of cleavage obtained 
at 10 mM (NH 4 ) 2 S0 4 , however, is identical to that observed when the 157 nucleotide 
25 template (SEQ ID NO:47) incubated in the absence of (NH 4 ) 2 S0 4 or in KC1 or NaCl. 

This indicates that the choice of salt included in the cleavase reaction has no effect on 
the nature of the sites recognized as substrates by the Cleavase™ BN enzyme (,\e., the 
inhibitory effect seen is due the effect of (NH 4 ) 2 S0 4 upon enzyme activity not upon 
the formation of the cleavage structures). 



202 



10 



15 



20 



c) Effect of Increasing KC1 Concentration on the 
Cleavage of Single-Stranded Substrates 

The effect of increasing the concentration of KC1 in cleavage reactions using a 
single-stranded DNA substrate was examined by performing the cleavage reaction in 
concentrations of KC1 which varied from 0 to 100 mM. The cleavage reactions were 
performed as described in section a) with the exception that KC1 was added to yield 
final concentrations of 0, 25, 50, 75 or 100 mM and 200 fmoles of the substrate were 
used in the reaction; additionally the substrate DNA was denatured by incubation at 
95 °C for 5 seconds. 

Following the cleavage reaction, the samples were electrophoresed, transferred 
to a membrane and detected as described in section a) above. The resulting 
autoradiograph is shown in Figure 54. 

In Figure 54, the lanes marked "M" contains molecular weight markers as 
described in Example 10a. Lane 1 is the no enzyme control; Lanes 2-7 contain 
reactions carried out in the presence of 0, 25, 50, 75, 100 or 100 mM KC1 (the 100 
mM sample was repeated twice), respectively. 

The results shown in Figure 54 demonstrate that the extent of cleavage in the 
cleavage reaction decreased as a function of increasing KC1 concentration (although 
residual cleavage was detectable at 100 mM KC1). Furthermore, the pattern of 
fragments generated by cleavage of single-stranded substrates with Cleavase™ BN is 
unaffected by the concentration of KC1 present in the reactions. 



d) Effect Of High KC1 Concentrations On Cleavage 
Reactions Using A Single-Stranded Substrate 

The ability of the Cleavase™ reaction to be carried out at relatively high 
25 concentrations of KC1 was tested by performing the cleavage reaction in the presence 

of variable concentrations of KC1 in excess of 100 mM. The reactions were performed 
using the 157 nucleotide fragment from exon 4 of the tyrosinase gene (SEQ ID 
NO:47) as described above in section c), with the exception that KC1 was added to 
yield a final concentration of 0, 100, 150, 200, 250 or 300 mM. 
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Following the cleavage reaction, the samples were electrophoreses transferred 
to a membrane and detected as described in section a) above. The results (data not 
shown) indicated that the cleavage reaction was severely inhibited by KCl 
concentrations in excess of 100 mM. Some residual cleavage did, however, persist at 
5 these elevated salt concentrations, up to and including 300 mM KCl. 

e) Effect Of KCl Concentration On The Stability Of The 
Cleavage Pattern During Extended Incubation Periods 

The results presented above demonstrate that the Cleavase™ reaction is 
inhibited by elevated concentrations (i.e., above 50 mM) of either KCl or NaCl. To 
10 determine whether this inhibition would effectively result in the stabilization of the 

cleavage pattern after extended reaction times (i.e., due to inhibition of enzyme 
activity), reactions were examined at varying extended time points at both 0 and 50 
mM KCl. 

Approximately 100 fmoles of the 157 nucleotide fragment derived from the 
15 sense strand of exon 4 of the tyrosinase gene (SEQ ID NO 47; prepared as described 

in example 10a) were placed in 200 ul thin wall microcentrifuge tubes (BioRad, 
Richmond, CA) in 1 X CFLP™ buffer, pH 8.2, 1.33 mM MnCl 2 (to yield a final 
concentration of 1 mM) and KCl to yield a final concentration of 0 or 50 mM KCl. 
The final reaction volume was 20 ul. 
20 Control reactions which lacked enzyme were set up in parallel for each time 

point examined; these no enzyme controls were prepared as described above with the 
exception that sterile distilled water was substituted for Cleavase™ BN and all reaction 
components were added prior to denaturation at 95°C. 

The tubes were heated to 95°C for 20 seconds and then rapidly cooled to 65°C. 
25 The cleavage reaction was started immediately by the addition of 5 ul of a diluted 

enzyme mixture comprising 1 ul of Cleavase™ BN [50 ng/ml in 1 X dilution buffer 
(0.5% NP40, 0.5% TWEEN 20, 20 mM Tris-HCl, pH 8.0, 50 mM KCl, 10 ng/ml 
BSA)] in IX CFLP™ buffer, pH 8.2 without MnCl 2 . Twenty microliters of Chill Out 
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1 4 ™ (MJ Research, Watertown, MA) were added to each tube after the addition of the 
enzyme. The reactions were allowed to proceed at 65°C for 5 min, 30 min, 1 hour, 2 

hours, 4 hours and 17 hours. 

At the desired time point, the reactions were stopped by the addition of 16 ul 
of stop buffer (95% formamide, 10 mM EDTA, 0.05% bromophenol blue, 0.05% 
xylene cyanol). Samples were heated to 72°C for 2 minutes and 7 ul of each reaction 
were resolved by electrophoresis through a 10% polyacrylamide gel (19:1 cross-link), 
with 7M urea, in a buffer containing 45 mM Tris-Borate, P H 8.3, 1.4 mM EDTA, as 

described in Example 10a. 

After electrophoresis, the DNA was transferred to a membrane and the detected 
as described in section a) above. The resulting autoradiograph is shown in Figure 55. 

In Figure 55, the lane marked "M" contains molecular weight markers as 
described in example 10a. Lanes 1-10 contain products from reactions carried out in 
the absence of KC1; lanes 11-20 contain products from reactions carried out in the 
presence of 50 mM KC1. Lanes 1, 3, 5, 7, and 9 contain no enzyme controls 
incubated for 5 minutes, 30 minutes, 1 hour, 2 hours, 4 hours and 17 hours, 
respectively. Lanes 2, 4, 6, 8, and 10 contain the reaction products from reactions 
incubated at 65°C for 5 minutes, 30 minutes, 1 hour, 2 hours, 4 hours and 17 hours, 
respectively. Lanes 11, 13, 15, 17, and 19 contain no enzyme controls incubated in 50 
mM KC1 for 5 minutes, 30 minutes, 1 hour, 2 hours, 4 hours and 17 hours, 
respectively. Lanes 12, 14, 16, 18 and 20 contain reaction products from CFLP™ 
reactions incubated in 50 mM KC1 at 65°C for 5 minutes, 30 minutes, 1 hour, 2 hours, 
4 hours and 17 hours, respectively. 

The results indicated that cleavage was retarded in the presence of 50 mM KC1 
which resulted in a significant stabilization of the cleavage pattern (i.e., the cleavage 
pattern remained the same over time because the rate of cleavage was dramatically 
slowed and thus the larger cleavage fragments are not further cleaved to produce 
smaller fragments). Note that at the extended incubation times, the reactions carried 
out in the absence of KC1 were significantly overdigested; after 1 hour at 65°C, 
essentially no large fragments remain, and there is substantial accumulation of small 
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cleavage products. In contrast, the reactions carried out at 50 mM KC1 were 
essentially static between 30 minutes and 4 hours; overdigestion was only apparent at 
the longest time point and was not as extensive as that observed in the absence of KC1. 

EXAMPLE 23 

Comparison Of The Patterns Of Cleavage Generated By 
Cleavage-Of-Smgle-Stranded And Double- Stranded DNA Substrates 

In Cleavase™ BN-mediated primer-independent cleavage of double-stranded 
DNA substrates, the two strands of DNA are separated in a denaturafion step prior to 
the addition of enzyme. Therefore, the patterns generated by cleaving double-stranded 
templates should be identical to those generated by cleaving single-stranded template. 
This assumption was verified by the experiment described below. 

The single-stranded substrate comprising the 157 nucleotide fragment derived 
from the sense strand of exon 4 of the tyrosinase gene (SEQ ID NO:47) was prepared 
as described in example 10b with the following modification. After gel purification 
and precipitation in the presence of glycogen carrier, the PCR products were 
resuspended in TE (10mM Tris-Cl, pH 8.0, 1 mM EDTA) and then reprecipitated with 
2 M NH 4 OAc and 2.5 volumes of ethanol. The DNA was then resuspended in 400 ul 

of 10 mM Tris-HCl, pH 8.0, 0.1 mM EDTA. 

Approximately 50 or 100 fmoles of the single-stranded 157 nucleotide fragment 
(SEQ ID NO: 47) were placed in a 200 ul centrifuge tube (BioRad, Richmond, CA) in 
1 X CFLP™ buffer, pH 8.2 and 1.33 mM MnCl 2 (final concentration was 1 mM 
MnCl 2 ) in a volume of 15 jxl. The final reaction volume was 20 ul. A 20 ul no salt, 
no enzyme control was set up in parallel; this reaction contained sterile distilled water 
in place of the Cleavase™ BN enzyme and all reaction components were added prior 

to denaturation at 95°C. 

The reaction tubes were heated to 95°C for 5 seconds and then rapidly cooled 
to 65°C The cleavage reactions were started immediately by the addition of 5 ul of a 
diluted enzyme mixture comprising 1 ,1 of Cleavase™ BN [50 ng/ul in 1 X dilution 
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buffer (0.5% NP40, 0.5% TWEEN 20, 20 mM Tris-HCl, pH 8.0, 50 mM KC1, 10 
ug/ml BSA)] in IX CFLP™ buffer, pH 8.2 without MnCl 2 . After 5 minutes at 65°C, 
reactions were stopped by the addition of 16 »*1 of stop buffer (95% formamide, 10 
mM EDTA, 0.05% bromophenol blue, 0.05% xylene cyanol). 
5 A double stranded form of the 157 nucleotide substrate was cleaved with 

Cleavase™ BN in the same experiment. This double- stranded substrate (SEQ ID 
NO:40) was generated as described in Example 10b with the following modifications. 
After gel purification and precipitation in the presence of glycogen carrier, the PCR 
products were resuspended in TE (lOmM Tris-Cl, pH 8.0, 1 mM EDTA) and then 
10 reprecipitated with 2 M NH 4 OAc and 2.5 volumes of ethanol. The DNA was then 

resuspended in 400 ul of 10 mM Tris-HCl, pH 8.0, 0.1 mM EDTA. 

Approximately 33 or 66 fmoles of the double-stranded 157 bp fragment (SEQ 
ID NO:40) were placed in a 200 ul thin walled microcentrifuge tube (BioRad, 
Richmond, CA). Sterile distilled water was added to a volume of 15 ul. 

The reaction tubes were heated to 95°C for 5 seconds and then rapidly cooled 
to 65°C. The cleavage reactions were started immediately by the addition of 5 ul of a 
diluted enzyme mixture comprising 4X CFLP™ buffer, pH 8.2, 0.8 mM MnCl 2 (to 
yield a final concentration of IX CFLP™ buffer and 0.2 mM MnCl 2 ) and 0.5 ul of 
Cleavase™ BN [50 ng/ul in 1 X dilution buffer (0.5% NP40, 0.5% TWEEN 20, 20 
20 mM Tris-HCl, P H 8.0, 50 mM KC1, 10 ug/ml BSA)]. A 20 ul no salt, no enzyme 

double-stranded substrate control was set up in parallel; this reaction contained sterile 
distilled water in place of the Cleavase™ BN enzyme. 

After 5 minutes at 65°C, the reactions were stopped by the addition of 16 ul of 
stop buffer (95% formamide, 10 mM EDTA, 0.05% bromophenol blue, 0.05% xylene 
25 cyanol). The samples were then heated to 72°C for 2 minutes and the reaction 

products were resolved by electrophoresis through a 10% polyacrylamide gel (19:1 
cross-link), with 7M urea, in a buffer containing 45 mM Tris-Borate, pH 8.3, 1.4 mM 
EDTA, as described in Example 10a. 



15 
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After electrophoresis, the gel plates were separated and overlaid with a nylon 
membrane, as described in example 10a. The DNA was transferred to the membrane 
and the membrane was dried, blocked in 1 X I-BLOCK (Tropix, Bedford, MA), 
conjugated with streptavidin-alkaline phosphatase (United States Biochemical), washed, 
5 reacted with CDP-STAR (Tropix, Bedford, MA), and exposed to X-ray film as 

described in Example 22a. The resulting autoradiograph is shown in Figure 56. 

In Figure 56, lanes 1-3 contain reaction products derived from reactions 
containing the single- stranded substrate; lanes 4-7 contain reaction products derived 
from reactions containing the double-stranded substrate. Lanes land 3 contain 7.0 ul 
10 of the reaction products derived from the cleavage reactions which contained either 50 

or 100 fmoles of the single- stranded substrate, respectively. Lane 2 contains 7.0 ul of 
the uncut single-stranded substrate control reaction. Lanes 4 and 6 contain 7.0 ul of 
the uncut double-stranded control reactions which contained either 33 or 66 fmoles of 
the substrate, respectively. Lanes 5 and 7 contain 7.0 ul of the reaction products 
15 derived from cleavage reactions which contained either 33 or 66 fmoles of the double- 

stranded substrate, respectively. Note that the uncut double-stranded control shows a 
doublet underneath the prominent band containing the 157 bp substrate; it is believed 
that this doublet represents alternative structures which migrate with an altered 
mobility rather than degradation products. This doublet does not appear in 
20 experiments performed using double-stranded DNA purified from a denaturing gel 

(See Example 24) 

Comparison of the cleavage patterns generated by cleavage of either the single- 
stranded or double-stranded substrate shows that identical patterns are generated. 
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EXAMPLE 24 

The Cleavase™ Reaction Using A Double Stranded DNA 
Template Is Sensitive to Large Changes In Reaction Conditions 

The results presented in Example 13 demonstrated that the Cleavase™ reaction 
5 is relatively insensitive to significant changes in numerous reaction conditions 

including, the concentration of MnCl 2 and KC1, temperature, the incubation period, the 
amount of Cleavase™ BN enzyme added and DNA preparation. The results shown in 
Example 13 demonstrated that when the Cleavase™ reaction is performed using a 
single-stranded substrate, the reaction is remarkably robust to large changes in 
10 conditions. The experiments shown below show that the cleavage of double- stranded 

substrates is restricted to a somewhat narrower range of reaction conditions. 

a) Generation Of The Double-Stranded 157 bp Fragment 
Of Exon 4 Of The Tyrosinase Gene 

The following experiments examine the effect of changes in reaction conditions 
15 when double-stranded DNA templates are used in the Cleavase™ reaction. The 

double-stranded substrate utilized was the!57 bp fragment of the wild type tyrosinase 
gene (SEQ ID NO:40). This 157 bp fragment was generated using symmetric PCR as 
descnbed in Example 10b. Briefly, approximately 75 fmoles of double-stranded 
substrate DNA were incubated with 50 pmoles of the primer 5' biotin- 
20 GCCTT ATTTT ACTTT AAAAAT-3 ' (SEQ ID NO: 45), 50 pmoles of the primer 5' 

fluorescein-TAAAGTTTTGTGTTATCTCA-3' (SEQ ID NO:46), 50 mM of each 
dNTP, 20 mM Tris-Cl, pH 8.3, 1.5 mM MgCl 2 , 50 mM KC1, with 0.05% TWEEN 20 
and 0.05% NONIDET P-40 (NP40). Tubes containing 95 ul of the above mixture 
were heated to 95°C for 5 seconds and cooled to 70°C. Five microliters of enzyme 
25 mix containing 1.25 units of Tag DNA polymerase in 20 mM Tris-Cl, pH 8.3, 1.5 mM 

MgCl 2 , 50 mM KC1, with 0.05% TWEEN 20 and 0.05% NONIDET P-40 were then 
added.' Each tube was overlaid with 50 ul of Chill Out 14™ (MJ Research, 
Watertown, MA). 
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The tubes were heated to 95°C for 45 seconds, cooled to 50°C for 45 seconds, 
heated to 72°C for 75 seconds for 30 repetitions with a 5 minute incubation at 72°C 
after the last repetition. The reactions were then ethanol precipitated to reduce the 
volume to be gel purified. NaCl was added to a final concentration of 400 mM and 
5 glycogen (in distilled water) was added to a final concentration of 200 ug/ml. Two 

and one-half volumes of 100% ethanol were added to each tube, and the tubes were 
chilled to -70°C for two and one-half hours. The DNA was pelleted and resuspended 
in one-fifth the original volume of sterile distilled water. 

The PCR products were gel purified as follows. An equal volume of stop 
10 buffer (95% formamide, 0.05% bromophenol blue, 0.05% xylene cyanol) was added to 

each tube and the tubes were heated to 72°C for 2 minutes. The products were 
resolved by electrophoresis through a 6 % denaturing polyacrylamide gel (19:1 cross- 
link) and 7 M urea in a buffer containing 45 mM Tris-Borate, pH 8.3 and 1.4 mM 
EDTA. The DNA was visualized by ethidium bromide staining and the 157 bp 
15 fragment was excised from the gel. The DNA was eluted from the gel slice by passive 

diffusion as described in Example 10a with the exception that diffusion was allowed to 
occur over two days at room temperature. The DNA was then precipitated with 
ethanol in the presence of 200 mM NaCl and no added carrier molecules. The DNA 
was pelleted and resuspended in 150 ul TE (10 mM Tns-Cl, P H 8.0, 0.1 mM EDTA). 

20 b) Effect Of KC1 Concentration On The Double- 

Stranded Cleavage Reaction 

To determine the effect of salt concentration upon the cleavage reaction when a 
double-stranded substrate was utilized, a single substrate was incubated in the presence 
of a fixed amount of the enzyme Cleavase™ BN (25 ng) in a buffer containing 10 
25 mM MOPS, pH 7.5, 0.2 mM MnCl 2 and varying concentrations of KC1 from 0 to 100 

mM. 

Approximately 100 fmoles of the 157 bp fragment derived from the exon 4 of 
the tyrosinase gene (SEQ ID NO:40; prepared as described above in section a) were 
placed in 200 ul thin wall microcentrifuge tubes (BioRad, Richmond, CA) in sterile 
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distilled water in a volume of 6.25 ul (the final reaction volume was 10 p.1). The 
tubes were heated to 95°C for 15 seconds and then rapidly cooled to 45°C. The 
cleavage reactions were started by the addition of 3.75 jxl of an enzyme mix containing 
2 7 X CFLP™ buffer, pH 7.5 (to yield a final concentration of 1 X), 0.53 mM MnCl 2 
(to yield a final concentration of 0.2 mM), 0.5 p.1 Cleavase™ BN [50 ng/ul in 1 X 
dilution buffer (0.5% NP40, 0.5% TWEEN 20, 20 mM Tris-HCl, P H 8.0, 50 mM KC1, 
10 us/ml BSA)], and KC1 to yield a final concentration of 0, 2.5, 5, 10, 15, 20, 25, 
30, 50 or 100 mM. The final reaction volume was 10 jxl. The enzyme solution was 
brought to room temperature before addition to the cleavage reaction. No enzyme 
(i.c, uncut) controls were set up in parallel at either 0 or 100 mM KC1, with the 
difference that sterile distilled water was substituted for the Cleavase™ BN. 

After 5 minutes at 45°C, the reactions were stopped by the addition of 8 p.1 of 
stop buffer (95% formamide, 10 mM EDTA, 0.05% bromophenol blue, 0.05% xylene 
cyanol). Samples were heated to 72°C for 2 minutes and 4 p.1 of each reaction were 
resolved by electrophoresis through a 10% polyacrylamide gel (19:1 cross-link), with 7 
M urea, in a buffer containing 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA. 

After electrophoresis, the gel plates were separated and overlaid with a nylon 
membrane, as described in Example 10b. The DNA was transferred to the membrane 
and the membrane was dried, washed in IX I-BLOCK Blocking Buffer, washed and 
exposed to X-ray film as described in Example 22a, except that the distilled water 
washes were omitted. The resulting autoradiograph is shown in Figure 57. 

In Figure 57, the lane marked "M" contains molecular weight markers. Lane 1 
contains the uncut control in 0 mM KC1 and shows the mobility of the uncleaved 
template DNA. Lanes 2 through 11 contain reaction products generated by incubation 
of the substrate in the presence of Cleavase™ BN enzyme in a buffer containing 0, 
2.5, 5, 10, 15, 20, 25, 30, 50, or 100 mM KC1, respectively. Lane 12 contains the 
uncut control incubated in a buffer containing 100 mM KC1. 

The results shown in Figure 57 demonstrate that the Cleavase™ reaction 
carried out on double-stranded DNA template was sensitive to variations in salt 
concentration. Essentially no cleavage was detected in reactions containing greater 
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than 30 inM KC1. The same cleavage pattern was obtained when the 157 bp 
tyrosinase DNA substrate (SEQ ID NO:40) was incubated with the Cleavase™ BN 
enzyme regardless of whether the concentration of KC1 was varied from 0 to 30 mM. 

c) Effect Of NaCl On The Double-Stranded Cleavage 
Reaction 

The effect of substituting NaCl in place of KC1 upon the cleavage pattern 
created by 5' nuclease activity on a double-stranded DNA substrate was examined. 
Approximately 100 fmoles of the 157 bp fragment derived from exon 4 of the 
tyrosinase gene (SEQ ID NO 40; prepared as described in Example 24a) were placed 
in 200 ul thin wall microcentrifuge tubes (BioRad, Richmond, CA) in sterile distilled 
water in a volume of 6.25 ul and were heated to 95°C for 15 seconds. The tubes were 
cooled to 45°C. The cleavage reaction was started by the addition of 3.75 ul of an 
enzyme mix containing 2.7 X CFLP™ buffer, pH 7.5 (to yield a final concentration of 

1 X) 0.53 mM MnCl 2 (to yield a final concentration of 0.2 mM), 0.5 ul Cleavase™ 
BN ['so ng/ul in 1 X dilution buffer (0.5% NP40, 0.5% TWEEN 20, 20 mM Tris-HCl, 
P H 8.0, 50 mM KC1, 10 jig/ml BSA)], and NaCl to yield a final concentration of 0, 

2 5 5 10, 15, 20, 25, 30, 50 or 100 mM. The final reaction volume was 10 ul. No 
enz'yme (/.e., uncut) controls were set up in parallel at either 0 or 100 mM NaCl, with 
the difference that sterile distilled water was substituted for the Cleavase™ BN. 

The reactions were incubated at 45°C for 5 minutes and were stopped by the 
addition of 8 ul of stop buffer (95% formamide, 10 mM EDTA, 0.05% bromophenol 
blue, 0.05% xylene cyanol). Samples were heated to 72°C for 2 minutes and 4 ul of 
each reaction were resolved by electrophoresis through a 10% polyacrylamide gel 
(19:1 cross-link), with 7 M urea, in a buffer containing 45 mM Tris-Borate, pH 8.3, 
1.4 mM EDTA. 
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After electrophoresis, the gel plates were separated and overlaid with a nylon 
membrane, as described in Example 10b. The DNA was transferred to the membrane 
and the membrane was dried, washed in IX I-BLOCK Blocking Buffer, washed and 
exposed to X-ray film as described in Example 22a with the exception that the distilled 
5 water washes were omitted. The resulting autoradio graph is shown in Figure 58. 

In Figure 58 , the lane marked "M" contains molecular weight markers. Lane 1 
contains the no enzyme control incubated in a buffer containing 0 mM NaCl and 
shows the mobility of the uncleaved template DNA. Lanes 2 through 1 1 contain 
reaction products generated by cleavage of the 157 bp substrate (SEQ ID NO:40) with 
10 the Cleavase™ BN enzyme in a buffer containing 0, 2.5, 5, 10, 15, 20, 25, 30, 50, or 

100 mM NaCl, respectively. Lane 12 contains the no enzyme control incubated in a 

buffer containing 100 mM NaCl. 

The results shown in Figure 58 demonstrate that the Cleavase™ reaction 
carried out on a double-stranded DNA template was sensitive to variations in NaCl 
15 concentration. Essentially no cleavage was detected above 20 mM NaCl. The same 

cleavage pattern was obtained when the 157 bp tyrosinase DNA template (SEQ ID 
NO:40) was incubated with the Cleavase™ BN enzyme regardless of whether the NaCl 
concentration was varied from 0 to 20 mM. 

d) Effect Of Substituting (NH 4 ) 2 S0 4 For KC1 In Cleavage 
20 Of Double-Stranded Template 

In an approach similar to that described in Example 22b, the ability of 
(NH 4 ) 2 S0 4 to substitute for KC1 in the cleavage reaction when double-stranded 
substrates were utilized was tested. Cleavage reactions were set up exactly as 
described in Examples 24b and c with the exception that variable amounts of 
25 (NH 4 ) 2 S0 4 were substituted for the KC1 or NaCl. 
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Approximately 100 fmoles of the 157 bp fragment derived exon 4 of the 
tyrosinase gene (SEQ ID NO 40; prepared as described above in section a) were 
placed in 200 ul thin wall microcentrifuge tubes (BioRad, Richmond, CA) in sterile 
distilled water in a volume of 6.25 ul and were heated to 95°C for 15 seconds. The 

tubes were cooled to 45°C 

Cleavage reactions were started by the addition of 3.75 ul of an enzyme mix 
containing 2.7 X CFLP™ buffer, pH 7.5 (to yield a final concentration of 1 X), 0.53 
mM MnCl 2 (to yield a final concentration of 0.2 mM MnCl 2 ), 0.5 ul Cleavase™ BN 
[50 ng/ul in 1 X dilution buffer (0.5% NP40, 0.5% TWEEN 20, 20 mM Tris-HCl, pH 
8.0, 50 mM KC1, 10 jig/ml BSA)], and (NH 4 ) 2 S0 4 to yield a final concentration of 0, 
2.5, 5, 10, 15, 20, 25, 30, 50 or 100 mM. The final reaction volume was 10 ul. No 
enzyme (i.e., uncut) controls were set up in parallel at either 0 or 100 mM (NH 4 ) 2 S0 4 , 
with the difference that sterile distilled water was substituted for the Cleavase™ BN. 

The reactions were incubated at 45°C for 5 minutes and were stopped by the 
addition of 8 ul of stop buffer (95% formamide, 10 mM EDTA, 0.05% bromophenol 
blue, 0.05% xylene cyanol). Samples were heated to 72°C for 2 minutes and 4 ul of 
each reaction were resolved by electrophoresis through a 10% polyacrylamide gel 
(19:1 cross-link), with 7 M urea, in a buffer containing 45 mM Tris-Borate, pH 8.3, 
1.4 mM EDTA. 

After electrophoresis, the gel plates were separated and overlaid with a nylon 
membrane, as described in Example 10b. The DNA was transferred to the membrane 
and the membrane was dried, washed in IX I-BLOCK Blocking Buffer, washed and 
exposed to X-ray film as described in Example 22a, except that the distilled water 
washes were omitted. The resulting autoradiograph is shown in Figure 59. 

In Figure 59, the lane marked "M" contains molecular weight markers. Lane 1 
contains the no enzyme control incubated in a buffer containing 0 mM (NH 4 ) 2 S0 4 and 
shows the migration of the uncleaved substrate DNA. Lanes 2 through 11 contain 
reaction products generated by incubation of the substrate in the presence of 
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Cleavase™ BN enzyme in a buffer containing 0, 2.5, 5, 10, 15, 20, 25, 30, 50, or 100 
mM (NH 4 ) 2 S0 4 , respectively. Lane 12 contains the no enzyme control incubated in a 
buffer containing 100 mM (NH 4 ) 2 S0 4 . 

The results shown in Figure 59 demonstrate that the Cleavase™ reaction was 
severely inhibited by the presence of (NH 4 ) 2 S0 4 . The reaction was completely 
inhibited by as little as 15 mM (NH 4 ) 2 S0 4 ; the extent of the cleavage reaction in 5 mM 
(NH 4 ) 2 S0 4 was comparable to that obtained in 20 mM KC1 and was significantly 
reduced relative to that obtained in 0 mM (NH 4 ) 2 S0 4 . The pattern of cleavage 
obtained using 5 mM (NH 4 ) 2 S0 4 , however, was identical to that observed when the 
157 bp substrate was incubated in the absence of (NH 4 ) 2 S0 4 or in KC1 or NaCl, 
indicating that the choice of salt included in the Cleavase™ reaction has no effect on 
the nature of the sites recognized by the enzyme. 



e) Time Course Of The Double-Stranded Cleavage 
Reaction 

15 To determine how quickly the double-stranded cleavage reaction is completed, 

a single substrate was incubated in the presence of a fixed amount of Cleavase™ BN 
enzyme for various lengths of time. Approximately 100 fmoles of the double-stranded 
157 bp fragment of exon 4 of the tyrosinase gene (SEQ ID NO 40; prepared as 
described above in Example 24a) were placed in sterile distilled water in 200 p.1 thin 

20 walled centrifuge tubes (BioRad, Richmond, CA) in a volume of 6.25 ul. The tubes 

were heated to 95°C for 15 seconds, as described in section b), and cooled to 45°C. 

Cleavage reactions were started by the addition of 3.75 ul of an enzyme mix 
containing 2.7 X CFLP™ buffer, pH 7.5 (to yield a final concentration of 1 X), 0.53 
mM MnCl 2 (to yield a final concentration of 0.2 mM MnCl 2 ), 0.5 ul Cleavase™ BN 

25 [50 ng/ul in 1 X dilution buffer (0.5% NP40, 0.5% TWEEN 20, 20 mM Tris-HCl, pH 

8.0, 50 mM KC1, 10 |ag/ml BSA)]. The final reaction volume was 10 ul. No enzyme 
(i.e., uncut) controls were set up in parallel and stopped after either 5 minutes or 120 
minutes, with the difference that sterile distilled water was substituted for the 
Cleavase™ BN enzyme. 
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The cleavage reactions were stopped by the addition of 8 ul of stop buffer 
(95o/o formamide, 10 mM EDTA, 0.05% bromophenol blue, 0.05% xylene cyanol) at 
the following tunes: 5 seconds, 1, 2, 5, 10, 15, 20, 30, 60 or 120 minutes. Samples 
were heated to 72°C for 2 minutes and 4 p.1 of each reaction were resolved by 
5 electrophoresis through a 10% polyacrylamide gel (19:1 cross-link), with 7 M urea, in 

a buffer containing 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA. 

After electrophoresis, the gel plates were separated and overlaid with a nylon 
membrane, as described in Example 10b. The DNA was transferred to the membrane 
and the membrane was dried, washed in IX I-BLOCK Blocking Buffer, washed and 
10 exposed to X-ray film as described in Example 22a with the exception that the distilled 

water washes were omitted. The resulting autoradio graph is shown in Figure 60. 

In Figure 60, lane 1 contains the no enzyme control after a 5 minute incubation 
at 45°C and shows the mobility of the uncleaved template DNA. Lanes 2-10 contain 
cleavage fragments derived from reactions incubated in the presence of Cleavase™ BN 
for 5 sec, 1, 2, 5, 10, 15, 20, 30, 60 (i hr), or 120 minutes (2 hr), respectively. Lane 
11 contains the no enzyme control after a 120 minute incubation at 45°C. 

Figure 60 shows that the cleavage of a double-stranded DNA template mediated 
by the Cleavase™ BN enzyme was rapid. A full cleavage pattern was apparent and 
essentially complete within one minute. Unlike the example of cleavage of a single- 
20 stranded DNA template (Example 13c), very little cleavage is detectable after 5 

seconds. This reaction contained one-tenth the amount of enzyme used in the reaction 
described in Example 13c. In addition, whereas incubation of single-stranded cleavage 
reactions for extended periods generated a pattern of increasingly truncated fragments 
(Example 22e), extended incubation of the double-stranded cleavage reaction resulted 
25 in a complete loss of signal (Figure 60, lane 10); this result is probably due to nibbling 

by the enzyme of the 5' biotin moiety from the reannealed strands. It is important to 
note that these results show that the same pattern of cleavage was produced for 
cleavage of double-stranded DNA, as for single-stranded, whether the reaction is run 



15 
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for 1 or 30 minutes. That is, the full representation of the cleavage products (i.e., 
bands) is seen over a 30-fold difference in time of incubation; thus the double- stranded 
CFLP™ reaction need not be strictly controlled in terms of incubation time. 

The results shown in Figure 61 contain short time courses of cleavage reactions 
5 performed at a variety of enzyme concentrations. Approximately 100 fmoles of the 

double-stranded 157 bp fragment of exon 4 of the tyrosinase gene (SEQ ID NO:40) 
were placed in sterile distilled water in 200 ul thin walled centrifuge tubes (BioRad, 
Richmond, CA) in a volume of 6.25 ul. The tubes were heated to 95°C for 15 
seconds, as described in Example 24b, and cooled to 45°C. Cleavage reactions were 
10 started by the addition of 3.75 ul of an enzyme mix containing 2.7X CFLP™ buffer, 

pH 7.5 (to yield a final concentration of 1 X), 0.53 mM MnCl 2 (to yield a final 
concentration of 0.2 mM MnCl 2 ), 0.5 ul Cleavase™ BN [at either 50, 100, 200, or 
500 ng/ul in 1 X dilution buffer (0.5% NP40, 0.5% TWEEN 20, 20 mM Tris-HCl, pH 
8.0, 50 mM KC1, 10 ug/ml BSA) to yield a final amount of enzyme of 25, 50, 100, or 
15 250 ng]. The final reaction volume was 10 ul. A no enzyme control was set up in 

parallel, with the difference that sterile distilled water was substituted for the 
Cleavase™ BN enzyme, and stopped after 1 minute at 45°C. 

The cleavage reactions were stopped by the addition of 8 ul of stop buffer 
(95% formamide, 10 mM EDTA, 0.05% bromophenol blue, 0.05% xylene cyanol) 
20 after either 5 seconds or 1 minute. Samples were heated to 72°C for 2 minutes and 4 

Hi of each reaction were resolved by electrophoresis through a 10% poly aery lamide gel 
(19:1 cross-link), with 7 M urea, in a buffer containing 45 mM Tris-Borate, pH 8.3, 
1.4 mM EDTA. 

After electrophoresis, the gel plates were separated and overlaid with a nylon 
25 membrane, as described in Example 10b. The DNA was transferred to the membrane 

and the membrane was dried, washed in IX I-BLOCK Blocking Buffer, washed and 
exposed to X-ray film as described in Example 22a, except that the distilled water 
washes were omitted. The resulting autoradio graph is shown in Figure 61. 
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In Figure 61, lane "M" contains molecular weight markers as described in 
Example 10a. Lane 1 contains the no enzyme control. Lanes 2 and 3 each contain 
reaction products generated by incubation of the substrate in the presence of 25 ng of 
Cleavase™ BN; the reaction in lane 2 was stopped after 5 seconds, that in lane 3, after 

5 1 minute. Lanes 4 and 5 contain reaction products generated by cleavage of the 

substrate in the presence of 50 ng of Cleavase™ BN; the reaction in lane 4 was 
stopped after 5 seconds, that in lane 5, after 1 minute. Lanes 6 and 7 contain reaction 
products generated by cleavage of the substrate in the presence of 100 ng of 
Cleavase™ BN enzyme; the reaction in lane 6 was stopped after 5 seconds, that in 

10 lane 7, after 1 minute. The reactions shown in lanes 8 and 9 each contain 250 ng of 

Cleavase™ BN; that in lane 8 was stopped after 5 seconds, that in lane 9, after 1 
minute. 

The results presented in Figure 61 indicate that the rate of cleavage of double- 
stranded DNA increased with increasing enzyme concentration. Note that as the 

1 5 concentration of enzyme was increased, there was a corresponding reduction in the 

amount of uncut DNA that remained after 1 minute. As was demonstrated below, in 
Figure 63, the concentration of enzyme included in the cleavage reaction had no effect 
on the cleavage pattern generated. Comparison of the 250 ng reaction (shown in 
Figure 61, lanes 8 and 9) to the short time point digestion described in Example 13c, 

20 indicates that the amount of enzyme rather than the double- stranded or single-stranded 

nature of the substrate controls the extent of cleavage in the very early time points. 

f) Temperature Titration Of The Double-Stranded 
Cleavage Reaction 

To determine the effect of temperature variation on the cleavage pattern, the 
25 157 bp fragment of exon 4 of the tyrosinase gene (SEQ ID NO:40) was incubated in 

the presence of a fixed amount of Cleavase™ BN enzyme for 5 minutes at various 
temperatures. Approximately 100 fmoles of substrate DNA (prepared as described in 
Example 24a) were placed in sterile distilled water in 200 ul thin walled centrifuge 
tubes (BioRad, Richmond, CA) in a volume of 6.25 ul. The tubes were heated to 
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95°C for 15 seconds and cooled to either 37, 40, 45, 50, 55, 60, 65, 60, 75, or 80°C. 
Cleavage reactions were started by the addition of 3.75 ul of an enzyme mix 
containing 2.7 X CFLP™ buffer, pH 7.5 (to yield a final concentration of 1 X), 0.53 
mM MnCl 2 (to yield a final concentration of 0.2 mM MnCl 2 ), 0.5 ul Cleavase™ BN 
[50 ng/ul in 1 X dilution buffer (0.5% NP40, 0.5% TWEEN 20, 20 mM Tris-HCl, pH 
8.0, 50 mM KC1, 10 ug/ml BSA)]. The enzyme mix was kept on ice throughout the 
duration of the experiment, but individual aliquots of the enzyme mix were allowed to 
come to room temperature before being added to the reactions. A second reaction was 
run at 37°C at the end of the experiment to control for any loss of enzyme activity that 
may have occurred during the course of the experiment. No enzyme controls were set 
up in parallel and incubated at either 37°C or 80°C, with the difference that sterile 
distilled water was substituted for the Cleavase™ BN. The reactions were stopped by 
the addition of 8 ul of stop buffer (95% formamide, 10 mM EDTA, 0.05% 
bromophenol blue, 0.05% xylene cyanol). 

Samples were heated to 72°C for 2 minutes and 5 ul of each reaction were 
resolved by electrophoresis through a 10% polyacrylamide gel (19:1 cross-link), with 7 
M urea, in a buffer containing 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA. 

After electrophoresis, the gel plates were separated and overlaid with a nylon 
membrane as described in Example 10b. The DNA was transferred to the membrane 
and the membrane was dried, washed in IX I-BLOCK Blocking Buffer, washed and 
exposed to X-ray film as described in Example 22a, except that the distilled water 
washes were omitted. The resulting autoradiograph is shown in Figure 62. 

In Figure 62, the lane marked "M" contains molecular weight markers, prepared 
as described in Example 10a. Lane 1 contains the no enzyme control after a 5 minute 
incubation at 37°C. Lanes 2 and 3 contain reactions incubated at 37°C, run at the 
beginning and end of the experiment, respectively. Lanes 4-13 contain reactions 
incubated at 40, 45, 50, 55, 60, 65, 70, 75, or 80°C [there are two 80°C samples; the 
first was not covered with Chill Out 14™ (MJ Research, Watertown, MA), the second 
was overlaid with 20 ul Chill Out 14™ after the addition of the enzyme mix], 
respectively. Lane 14 contains a no enzyme control incubated at 80°C for 5 minutes. 
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Figure 62 shows that cleavage of double-stranded DNA substrates proceeded 
most effectively at lower temperatures. The distribution of signal and pattern of 
cleavage changed smoothly in response to the temperature of incubation over the range 
of 37°C to 60°C. Some cleavage products were evident only upon incubation at 
5 higher temperatures, whereas others were far more predominant at lower temperatures. 

Presumably, certain structures that are substrates for the Cleavase™ BN enzyme at one 
end of the temperature range are not favored at the other. As expected, the production 
of cleavage fragments became progressively less abundant in the high end of the 
temperature range as the cleavage structures were melted out. Above 70°C, the 
10 cleavage products were restricted to small fragments, presumably due to extensive 

denaturation of the substrate. When longer DNAs (350 to 1000 nucleotides) are used, 
it has been found that useful patterns of cleavage are generated up to 75°C. 

These results show that the cleavage reaction can be performed over a fairly 
wide range of temperatures using a double- stranded DNA substrate. As in the case of 
15 the single-stranded cleavage reaction, the ability to cleave double-stranded DNA over a 

range of temperatures is important. Strong secondary structures that may dominate the 
cleavage pattern are not likely to be destabilized by single-base changes and may 
therefore interfere with mutation detection. Elevated temperatures can then be used to 
bring these persistent structures to the brink of instability, so that the effects of small 
20 changes in sequence are maximized and revealed as alterations in the cleavage pattern. 

This also demonstrates that within the useful temperature range, small changes in the 
reaction temperature due to heating block drift or similar device variations will not 
cause radical changes in the cleavage pattern. 

g ) Titration Of The Cleavase™ BN Enzyme In Double- 
25 Stranded Cleavage Reactions 

The effect of varying the concentration of the Cleavase™ BN enzyme in the 
double-stranded cleavage reaction was examined. Approximately 100 fmoles of the 
157 bp fragment of exon 4 of the tyrosinase gene (SEQ ID NO:40; prepared as 
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described in Example 24a) were placed in sterile distilled water in 200 ul thin walled 

centnfuge tubes (BioRad, Richmond, CA) in a total volume of 6.25 ul. These tubes 

were heated to 95°C for 15 seconds and then rapidly cooled to 45'C. 

Cleavage reactions were started immediately by the addition of 3.75 ul of a 

diluted enzyme mix containing 2.7 X CFLP™ buffer, pH 7.5 (to yield a final 

concentration of 1 X), 0.53 mM MnCl 2 (to yield a final concentration of 0.2 mM 

MnCl ; ) 0.5 ul Cleavase™ BN [2, 10, 20, 50, 100, 200, 500 ng/ul in 1 X dilution 

buffer (0.5% NP40, 0.5% TWEEN 20, 20 mM Tris-HCl, pH 8.0, 50 mM KCl, 10 

ug/ml BSA) such that 1, 5, 10, 25, 50, 100 or 250 ng of enzyme was added to the 

j reactions]. No enzyme controls were set up in parallel, with the difference that IX 

dilution buffer was substituted for the Cleavase™ BN. After 5 minutes at 45°C, the 

reactions were stopped by the addition of 8 ,1 of stop buffer (95% formanude, 10 mM 

EDTA 0.05% bromophenol blue, 0.05% xylene cyanol). The samples were heated to 

72°C for 2 minutes and 4 ul of each reaction were resolved by electrophoresis through 

1 • j i no-1 r> mC5 linkl with 7 M urea, in a buffer containing 45 
5 a 10% polyacrylamide gel (19.1 cross-linig, wnn / m uio , 

mM Tris-Borate, pH 8.3, 1.4 mM EDTA. 

After electrophoresis, the gel plates were separated and overlaid with a nylon 
membrane as described in Example 10b. The DNA was transferred to the membrane 
and the membrane was dried, washed in IX I-BLOCK Blocking Buffer, washed and 
20 exposed to X-ray film as described in Example 22a, except that the distilled water 

washes were omitted. The resulting autoradiograph is shown in Figure 63. 

The lane marked "M" in Figure 63 contains molecular weight markers. Lane 1 
contains the no enzyme control and shows the migration of the uncut substrate. Lanes 
2-8 contain cleavage products derived from reactions containing 1, 5, 10, 25, 50, 100 
25 or 250 ng of the Cleavase™ BN enzyme, respectively. 

These results show that the same cleavage pattern was obtained using the 157 
bp tyrosinase DNA substrate (SEQ ID NO:40) regardless of whether the amount of 
enzyme used in the reaction varied over a 50-fold range. Thus, the double-stranded 
cleavage reaction is ideally suited for practice in clinical laboratories where reaction 
30 conditions are not as controlled as in research laboratories. Note, however, that there 
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is a distinct optimum for cleavage at intermediate enzyme concentrations for a double- 
stranded template, in marked contrast to what was observed on single- stranded 
substrates (Example 13c). The progressive loss of signal in the double- stranded 
reactions at mcreasmg concentrations of Cleavase™ BN is likely due to the nibbling of 
5 the 5' biotin label off the end of the reannealed double- stranded template. 

EXAMPLE 25 

Determination Of The pH Optimum For Single- 
Stranded And Double-Stranded Cleavage Reactions 

In order to establish optimal pH conditions for the two types of primer- 
10 independent cleavage reactions (i,, single- stranded and double-stranded cleavage 

reactions), the Cleavase™ reaction buffer was prepared at a range of different pHs. 

a) Establishment Of A pH Optimum For The Single- 
Stranded Cleavage Reaction 

The effect of varying the pH of the Cleavase™ reaction (i.e., CFLP™) buffer 
15 upon the cleavage of single-stranded substrates was examined. Several 10 X buffer 

solutions were made with 0.5 M MOPS at pH 6.3, 7.2, 7.5, 7.8, 8.0 and 8.2 by 
titrating a 1 M solution of MOPS at pH 6.3 with 6 N NaOH. The volume was then 
adjusted to yield a 0.5 M solution at each pH. 

Approximately 100 fmoles of a single-stranded substrate prepared from the 
20 sense strand of exon 4 of the tyrosinase gene (SEQ ID NO:47; prepared as described 

in Example 10a), were placed in 200 ul thin walled centrifuge tubes (BioRad, 
Richmond, CA) in 15 pi of 1 X CFLP™ buffer, at varying pH, and 1.33 mM MnCl 2 
(to yield a final concentration of 1 mM). The final reaction volume was 20 pi. The 
reaction mixes were heated to 95°C for 5 seconds and rapidly cooled to 65°C. The 
25 reactions were started by the addition of 5 pi of diluted enzyme mix containing 1 pi of 

Cleavase™ BN [50 ng/ul in 1 X dilution buffer (0.5% NP40, 0.5% TWEEN 20, 20 
mM Tns-HCl, pH 8.0, 50 mM KC1, 10 pg/ml BSA)] in 1 X CFLP™ buffer (without 
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MnCl 2 ) again at the appropriate pH. A 20 jxl no salt, no enzyme control was set up m 
parallel and incubated at 65°C for each of the indicated P Hs, with the difference that 
sterile distilled water was substituted for Cleavase™ BN and all reaction components 
were added prior to denaturation. Reactions were stopped by the addition of 16 ^1 of 
stop buffer (95% formamide, 10 mM EDTA, 0.05% bromophenol blue, 0.05% xylene 

cyanol) after 5 minutes. 

Samples were heated to 72°C for 2 minutes and 7 ul of each reaction were 
resolved by electrophoresis through a 10% polyacrylamide gel (19:1 cross-link), with 
7M urea, in a buffer contaimng 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA, as 

described in Example 10a. 

After electrophoresis, the gel plates were separated and overlaid with a nylon 
membrane, as described in Example 10a. The DNA was transferred to the membrane 
and the membrane was dried, blocked in 1 X I-BLOCK (Tropix, Bedford, MA), 
conjugated with streptavidin-alkaline phosphatase (United States Biochemical), washed, 
reacted with CDP-STAR™ (Tropix, Bedford, MA), and exposed to X-ray film as 
described in Example 22a, except that the distilled water washes were omitted. The 

results are presented in Figure 64. 

In Figure 64, panels A and B contain reactions which used single- stranded 
DNA substrates. In panel A, 5 pairs of reactions are presented. In each case, the first 
lane of the pair is the no enzyme control and the second is the single-stranded 
cleavage reaction. Lanes 1 and 2 depict reaction products obtained using a reaction 
buffer at pH 6.3; lanes 3 and 4, at pH 7.2; lanes 5 and 6, P H 7.8; lanes 7 and 8, pH 
8 0- lanes 9 and 10, at P H 8.2. Panel B contains the results of a separate experiment 
comparing cleavage reactions performed using a reaction buffer at pH 7.5 (lanes 1 and 
2, uncut and cut, respectively) and at P H 7.8 (lanes 3 and 4, uncut and cut, 
respectively). 
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The results shown in Figure 64, panels A and B, indicate that the cleavage of 
the single-stranded DNA template was sensitive to relatively small changes in pH. 
There was a pH optimum for the reaction between P H 7.5 and 8.0. Because the pK, 
of MOPS is 7.2, the pH closest to that value which supported cleavage, pH 7.5, was 
determined to be optimal for the single-stranded cleavage reaction. 

b) Establishment Of A pH Optimum For The Double- 
Stranded Cleavage Reaction 

The effect of varying the pH of the Cleavase™ reaction (i.e., CFLP™) buffer 
upon the cleavage of double-stranded substrates was examined. Several 10 X buffer 
solutions were made with 0.5 M MOPS at pH 7.2, 7.5, 7.8, and 8.0, as described 
above in section a). Approximately 100 fmoles of the double-stranded 157 bp 
fragment of exon 4 of the tyrosinase gene (SEQ ID NO:40; prepared as described m 
Example 10) were placed in 200 ul thin walled centrifuge tubes (BioRad, Richmond, 
CA) in a total volume of 6.25 ul. The tubes were heated to 95°C for 15 seconds and 
cooled to 45°C. The clevage reactions were started by the addition of 3.75 ul of 
diluted enzyme mix containing 2.7 X CFLP™ buffer, pH 7.5 (to yield a final 
concentration of 1 X), 0.53 mM MnCl 2 (to yield a final concentration of 0.2 mM 
MnCl 2 ), 0.5 p.1 of Cleavase™ BN [50 ng/ul in 1 X dilution buffer (0.5% NP40, 0.5% 
TWEEN 20, 20 mM Tris-HCl, pH 8.0, 50 mM KC1, 10 u,g/ml BSA)]. 

The cleavage reactions were incubated for 5 minutes and then were terminated 
by the addition of 8 ul of stop buffer (95% formamide, 10 mM EDTA, 0.05% 
bromophenol blue, 0.05% xylene cyanol). 

Samples were heated to 72°C for 2 minutes and 4 ul of each reaction were 
resolved by electrophoresis through a 10% polyacrylamide gel (19:1 cross-link), with 
7M urea, in a buffer containing 45 mM Tris-Borate, P H 8.3, 1.4 mM EDTA. 
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After electrophoresis, the gel plates were separated and overlaid with a nylon 
membrane, as described in Example 10b. The DNA was transferred to the membrane 
and the membrane was dried, washed in IX I-BLOCK Blocking Buffer, washed and 
exposed to X-ray film as described in Example 22a, except that the dialled water 
5 washes were omitted. The resulting autoradiography are shown m Figure 65, panels A 

and B. 

In Figure 65, panel A, lanes 1 and 2 contain cleavage products from reactions 
run in a buffer at pH 8.2 (lane 1 contains the cleavage reaction; lane 2 is the uncut 
control). Lanes 3 and 4 contain cleavage products from reactions run in a buffer at pH 
10 7 2 (lane 3 contains the cleavage reaction; lane 4 is the uncut control). In panel B, 

lanes 1 and 2 contain cleavage products from reactions run in a buffer at pH 7.5 (lane 
1 is the uncut control; lane 2 contains the cleavage reaction). Lanes 3 and 4 contain 
cleavage products from reactions run in a buffer at pH 7.8 (lane 3 contains the uncut 
control- lane 4 contains the cleavage reaction). 

The results in Figure 65, panels A and B, demonstrate that the cleavage of 
double-stranded DNA was not sensitive to changes in pH over the range of buffer 
conditions tested. Because the cleavage of single-stranded DNA, however, was 
sensitive to changes in pH, the buffer conditions that were determined to be optimal 
for the single-stranded cleavage reaction were chosen for subsequent double-stranded 
20 cleavage experiments. 

EXAMPLE 26 

The Presence Of Competitor DNA Does Not Alter The Cleavage Pattern 

The effect of the presence of competitor (i.e., non-labelled substrate) DNA 
upon the cleavage reaction was examined. The cleavage reaction was run using the 
25 157 nucleotide fragment from the sense strand of the human tyrosinase gene (SEQ ID 

NO-47) and human genomic DNA. The results shown below demonstrate that the 
presence of non-substrate DNA has no effect on the CFLP™ pattern obtained m the 
cleavage reaction. 
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a) Preparation Of The Substrate DNA And The Cleavage 
Reactions 

The 157 nucleotide single- stranded wild type tyrosinase substrate (SEQ ID 
NO:47) containing a biotin label on the 5' end was prepared as described in Example 
11. Human genomic DNA (Promega) present at 235 ug/ml in Tris-HCl, pH 8.0; 1 
mM EDTA was ethanol precipitated and resuspended in Tris-HCl, pH 8.0; 0.1 mM 
EDTA to final concentration 400 ug/ml. This DNA was used as a competitor in 
standard CFLP™ single-stranded reactions (described in Example 11). Tyrosinase 
DNA substrate (SEQ ID NO:47) and human genomic DNA were mixed in H 2 0 in 
final volume of 6 ul. Samples were heated at 95°C for 10 seconds to denature the 
DNA, cooled to the target temperature of 65°C, and mixture of 2 ul 5X CFLP™ 
buffer, P H 7.5, 1 ul 10 mM MnCl 2 and 1 ul (25 ng) the enzyme Cleavase™ BN in 
dilution buffer was added. After 5 minutes at 65°C, 6 ul of stop buffer was added to 
terminate reaction and 5 ul of each sample was separated on a 10% denaturing 
polyacrylamide gel. Membrane transfer and DNA visualization were performed as 
described in Example 21. 

b) The Presence Of Genomic DNA Does Not Alter The 
CFLP™ Pattern 

Figure 66 shows the resulting pattern corresponding to the cleavage products of 
the sense strand of the wild type tyrosinase substrate (SEQ ID NO:47) in the presence 
of 0 ug/ml (lane 2), 20 ug/ml (lane 3), 40 ug/ml (lane 4), 80 ug/ml (lane 5), 120 
ug/ml (lane 6) and 200 ug/ml (lane 7) unlabeled human genomic DNA. Lane 1 shows 
an uncut control in the absence of the enzyme Cleavase™ BN and lane marked "M" 
contains the molecular weight markers prepared as described in Example 10. 

Figure 66 shows that the presence of genomic DNA in the cleavage reaction 
did not change either the position or the relative intensity of the product bands 
produced. Increasing the amount of nonspecific DNA in the reaction did, however, 
decrease the efficiency of the cleavage reaction and reduced the overall intensity of the 
pattern. These results can be explained by the binding of the Cleavase™ BN enzyme 
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to the nonspecific DNA which has the effect of decreasing the effective enzyme 
concentration in the reaction. This effect became significant when the concentrate of 
genomic DNA in the reaction was equal to or greater than 120 ng/ml [Figure 66 , 
lanes 6 (120 ug/ml) and 7 (200 ng/ml)]. Under these conditions, the genomic DNA 

5 was present at more than a 20,000-fold excess relative to the specific substrate DNA; 

nonetheless the CFLP™ pattern could still be recognized under these conditions. The 
observed stability of the CFLP™ pattern in the presence of genomic DNA ruled out 
the possibility that nonspecific DNA could significantly change the structure of the 
substrate DNA or alter the interaction of the Cleavase™ BN enzyme with the 

10 substrate. 

EXAMPLE 27 

The CFLP™ Reaction Can Be Practiced Using A Variety of Enzymes 

The above Examples demonstrated the ability of Cleavase™ BN, a 5' nuclease 
derived from Tag DNA polymerase, to generate a characteristic set of cleavage 
15 fragments from a nucleic acid substrate. The following experiments demonstrate that a 

number of other enzymes can be used to generate a set of cleavage products which are 
characteristic of a given nucleic acid. These enzymes are not limited to the class of 
enzymes characterized as 5' nucleases. 

a) Cleavage Patterns Generated by Other DNA 
20 Polymerases From The Genus Thermus 

To determine whether 5' nuclease activity associated with DNA polymerases 
(DNAPs) other than Tag DNAP could generate a distinct cleavage pattern from nucleic 
acid substrates, DNAPs from two species of Thermus were examined. The DNAP of 
Thermus flavus ["Tfl", Kaledm et al, Biokhimiya 46:1576 (1981); obtained from 
25 Promega Corp., Madison, WI] and the DNAP of Thermus thermophilus ["Tth", 

Carballeira et al, Biotechniques 9:276 (1990); Myers et al, Biochem. 30:7661 (1991); 
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obtained from U.S. Biochemicals, Cleveland, OH] were examined for their ability to 
generate suitable cleavage patterns (i.e., patterns which can be used to characterize a 
given nucleic acid substrate). 

The ability of these other enzymes to cleave nucleic acids in a structure-specific 
manner was tested using the single-stranded 157 nucleotide fragment of the sense 
strand of exon 4 of the tyrosinase gene (SEQ ID NO:47) under conditions reported to 
be optimal for the synthesis of DNA by each enzyme. 

Approximately 100 fmoles of the 157 nucleotide fragment derived from the 
sense strand of exon 4 of the tyrosinase gene (SEQ ID NO 47; prepared as described 
in example 10a) were placed in 200 ul thin wall microcentrifuge tubes (BioRad, 
Richmond, CA) in 1 X CFLP™ buffer, pH 8.2 and 1.33 mM MnCl 2 (to yield a final 
concentration of 1 mM) and KC1 to yield a final concentration of either 0 or 50 mM. 
Final reaction volumes were 20 ul. Samples were heated to 95°C for 5 seconds and 
then cooled to 65°C. A 20 ul no salt, no enzyme control was set up in parallel, with 
the differences that sterile distilled water was substituted for Cleavase™ BN and all 
reaction components were added prior to denaturation at 95°C. 

The cleavage reactions were started by the addition of 5 ul of a diluted enzyme 
mix containing either 1.25 units or 5 units of the indicated enzyme (see below) in 1 X 
CFLP™ buffer, pH 8.2. After 5 minutes, reactions were stopped by the addition of 16 
Hi of stop buffer (95% formamide, 10 mM EDTA, 0.05% bromophenol blue, 0.05% 
xylene cyanol). 

Samples were heated to 72°C for 2 minutes and 7 ul (in the case of the 
samples digested with Tfl) or 5 ul (in the case of the samples digested with Tth) were 
electrophoresed through a 10% polyacrylamide gel (19:1 cross-link), with 7M urea, in 
a buffer containing 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA, as described in 
Example 10a. 

After electrophoresis, the gel plates were separated and overlaid with a nylon 
membrane, as described in Example 10a. The DNA was transferred to the membrane 
and the membrane was dried, blocked in 1 X I-BLOCK (Tropix, Bedford, MA), 
conjugated with streptavidin- alkaline phosphatase (United States Biochemical, 
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Cleveland, OH), washed, reacted with CDP-STAR™ (Tropix, Bedford, MA), and 
exposed to X-ray film as described in Example 22a, except that the distilled water 
washes were omitted. The results are presented in Figures 67 and 68. 

In Figure 67, lane 1 contains the no enzyme control and indicates the migration 
of the uncut DNA. Lanes 2-5 contain cleavage products derived from reactions 
incubated with Tfl DNAP. The reactions represented in lane 2 and 3 each contained 5 
units of Tfl DNAP; the sample in lane 2 was incubated in a reaction buffer containing 
0 mM KCl, while the sample in lane 3 was incubated in a reaction buffer containing 
50 mM KCl. The reactions in lanes 4 and 5 each contained 1.25 units of Tfl DNAP; 
the sample in lane 4 was incubated in a reaction buffer containing 0 mM KCl; that in 
lane 5 was incubated in a reaction buffer containing 50 mM KCl. 

In Figure 68, lanes 1 and 2 each contain cleavage products derived from 
reactions incubated with 1.25 units of Tth DNAP. The sample in lane 1 was incubated 
m a reaction buffer containing 0 mM KCl; that in lane 2 was incubated in a reaction 
buffer containing 50 mM KCl. Lanes 3 and 4 contain cleavage products derived from 
reactions incubated with 5 units of Tth DNAP. The sample shown in lane 3 was 
incubated in a reaction buffer containing 0 mM KCl; that in lane 4 was incubated in a 
reaction buffer containing 50 mM KCl. 

Figures 67 and 68 demonstrates that both Tth DNAP and Tfl DNAP display 
structure specific endonuclease activity similar in nature to that seen in the Cleavase™ 
BN enzyme. A comparison of the results shown in Figures 67 and 68 showed that the 
Tth DNAP was more efficient at generating a cleavage pattern under the reaction 
conditions tested. Comparison of the cleavage patterns generated by Tth DNAP with 
those generated by the Cleavase™ BN enzyme the indicates that essentially the same 
structures are recognized by these two enzymes [compare Figure 69, lane 2 
(Cleavase™ BN) with Figure 68 (Tth DNAP)]. 
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b) Enzymes Characterized As 3' Nucleases Can be Used 
To Generate Distinct Clevage Patterns 

To determine whether enzymes possessing 3' nucleolytic activity could also 
generate a distinct cleavage pattern, enzymes other than DNAPs (winch possess 5' 
5 nuclease activity) were tested in the cleavage reaction. Exonuclease III from 

Escherichia coli (E coli Exo III) was tested in a cleavage reaction using the 157 
nucleotide fragment prepared from the sense strand of exon 4 of the tyrosinase gen 
(SEQ ID NO:47). As a comparison, a reaction containing this substrate (SEQ ID 
NO-47) and Cleavase™ BN was also prepared. 
10 Approximately 100 fmoles of the 157 nucleotide fragment prepared from the 

sense strand of exon 4 of the tyrosinase gene (SEQ ID NO:47; prepared as described 
in Example 10a) were placed in 200 jil thin wall microcentrifuge tubes (BioRad, 
Richmond, CA) in 1 X CFLP™ buffer, pH 8.2 and 1.33 mM MnCl 2 (to yield a final 
concentration of 1 mM) and KC1 to yield a final concentration of either 0 or 50 mM m 
15 a volume of 15 ul. Final reaction volumes were 20 ul. 

The samples were heated to 95°C for 5 seconds and then rapidly cooled to 
37°C. A 20 ul no salt, no enzyme control was set up in parallel, with the differences 
that sterile distilled water was substituted for Cleavase™ BN and all reaction 
components were added prior to denaturation at 95°C 

A reaction tube containing 100 fmoles of the 157 nucleotide fragment (SEQ ID 
NO:47) and 50 ng of Cleavase™ BN in a buffer containing 0 mM KC1 was prepared 
and treated as described in Example 23 (i.e., denatured by incubation at 95°C for 5 
seconds followed by cooling to 65°C and the addition of the enzyme and incubation at 

65°C for 5 minutes). 

The cleavage reactions were started by the addition of 5 jxl of a diluted enzyme 
mix containing either 1.25 units or 200 units of Exo III (United States Biochemical, 
Cleveland, OH) in 1 X CFLP™ buffer, P H 8.2 (without MnCl 2 ) were added to the 15 
Hi reactions, and the reactions were incubated for 5 minutes. After 5 minutes at 37°C, 
the reactions were stopped by the addition of 16 ,1 of stop buffer (95% formamide, 10 
30 mM EDTA, 0.05% bromophenol blue, 0.05% xylene cyanol). 
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The samples were heated to 72°C for 2 minutes and 5 ul were electrophoresed 
through a 10% polyacrylamide gel (19:1 cross-link), with 7M urea, in a buffer 
containing 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA, as described in Example 10a. 
After electrophoresis, the gel plates were separated and overlaid with a nylon 
5 membrane, as described in example 10a. The DNA was transferred to the membrane 

and the membrane was dried, blocked in 1 X I-BLOCK (Tropix, Bedford, MA), 
conjugated with streptavidin-alkaline phosphatase (United States Biochemical, 
Cleveland, OH), washed, reacted with CDP-STAR™ (Tropix, Bedford, MA), and 
exposed to X-ray film as described in Example 22a, except that the distilled water 
10 washes were omitted. The results are presented in Figure 69. 

Lane 1 in Figure 69 contains the no enzyme control and indicates the mobility 
of the uncut DNA. Lane 2 contains cleavage fragments generated by incubation of the 
substrate with Cleavase™ BN enzyme and provides a comparison of the patterns 
generated by the two different enzymes. Lanes 3-6 contain cleavage fragments 
15 generated by incubation of the substrate with Exo III. Lanes 3 and 4 each contain 

reaction products generated in reactions which contained 200 units of Exo III; the 
reaction in lane 3 was run in a buffer containing 0 mM KCl, that in lane 4 was run in 
a buffer containing 50 mM KCl. Lanes 5 and 6 each contain reaction products 
generated in reactions which contained 1.25 units of Exo III; the reaction in lane 5 was 
20 run in a buffer containing 0 mM KCl, that in lane 6 was run in a buffer containing 50 

mM KCl. 

The results presented in Figure 69 demonstrate that Exo III generated a distinct 
cleavage pattern when incubated with a single- stranded DNA substrate. The pattern 
generated by Exo III was entirely distinct from that generated by the Cleavase™ BN 
25 enzyme. The results shown in Figure 69 also show that significant differences in the 

cleavage pattern generated by Exo III were observed depending on the concentrations 
of both the enzyme and KCl included in the reactions. 



231 




c) Ability Of Alternative Enzymes To Identify Single 
Base Changes 

In sections and a) and b) above it was shown that enzymes other than 
Cleavase™ BN could generate a distinct pattern of cleavage fragments when incubated 

5 in the presence of a nucleic acid substrate. Because both Tth DNAP and E. coli Exo 

III generated distinct cleavage patterns on single-stranded DNA, the ability of these 
enzymes to detect single base changes present in DNA substrates of the same size was 
examined. As in Example 11, the human tyrosinase gene was chosen as a model 
system because numerous single point mutations have been identified in exon 4 of this 

10 gene. 

Three single-stranded substrate DNAs were prepared; all three substrates 
contained a biotin label at their 5' end. The wild type substrate comprises the 157 
nucleotide fragment from the sense strand of the human tyrosinase gene (SEQ ID 
NO:47). Two mutation-containing substrates were used. The 419 substrate (SEQ ID 
NO:54) and the 422 substrate (SEQ ID NO:55), both of which are described in 
Example 11. Single-stranded DNA containing a biotin label at the 5' end was 
generated for each substrate using asymmetric PCR as described in Example 10a with 
the exception that the single-stranded PCR products were recovered from the gel rather 
than the double-stranded products. 

Cleavage reactions were performed as follows. Each substrate DNA 
(approximately 100 fmoles) was placed in a 200 ul thin wall microcentrifuge tubes 
(BioRad, Richmond, CA) in 5 p.1 of IX CFLP™ buffer with 1.33 mM MnCl 2 (to yield 
a final concentration of 1 mM). A no enzyme control was set up with the wild type 
DNA fragment in parallel and incubated at 65°C for each of the indicated time points, 
with the differences that sterile distilled water was substituted for Cleavase™ BN and 
all reaction components were added prior to denaturation at 95°C. The reaction tubes 
were brought to 95°C for 5 seconds to denature the substrates and then the tubes were 
quickly cooled to 65°C for the reactions containing Tth DNAP and 37°C for the 
reactions containing Exo III. 
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Cleavage reactions were started immediately by the addition of a diluted 
enzyme rmxture contaming 1.25 units of the enzyme either Tth DNAP or Exo III in 5 
Hi of IX CFLP™ buffer without MnCl 2 . The enzyme solution was brought to room 
temperature before addition to the cleavage reaction. After 5 minutes at 65°C, the 
reactions were stopped by the addition of 8 ,1 of stop buffer (95% formarmde, 10 mM 
EDTA 0.05O/O bromophenol blue, 0.05% xylene cyanol). The samples were heated to 
72°C for 2 minutes and 7 p.1 of each reaction were resolved by electrophoresis through 
a 10% polyacrylamide gel (19:1 cross-link), with 7 M urea, in a buffer containing 45 
mM Tris-Borate, pH 8.3, 1.4 mM EDTA. 

After electrophoresis, the gel plates were separated and overlaid with a nylon 
membrane, as described in example 10a. The DNA was transferred to the membrane 
and the membrane was dried, blocked in 1 X I-BLOCK (Tropix, Bedford, MA), 
conjugated with streptavidin-alkaline phosphatase (United States Biochemical), washed, 
reacted with CDP-Star™ (Tropix, Bedford, MA), and exposed to X-ray film as 
described in Example 22a with the exception that the distilled water washes were 
omitted. The results are presented in Figure 70. 

In Figure 70, lanes 1-3 contain cleavage fragments generated by incubation of 
either the wild-type, mutant 419 and mutant 422 alleles of the tyrosinase gene, 
respectively, with Tth DNAP. Lanes 4-6 contain cleavage fragments generated by 
incubation of either the wild type, mutant 419 and mutant 422 substrates, respectively, 
with Exo III in a buffer containing 0 mM KC1. Lanes 7-9 contain cleavage fragments 
generated by incubation of either the wild type, mutant 419 and mutant 422 substrates, 
respectively, incubated with Exo III in a buffer containing 50 mM KC1. Lane 10 
contains cleavage fragments generated by incubation of the wild type DNA substrate 
with Cleavase™ BN in a buffer containing 0 mM KC1; this reaction provides a 
comparison of the patterns generated by the three different enzymes (i.e., Cleavase™ 
BN, Tth DNAP and Exo III). Lane 11 contains the no enzyme control with the wild 
type DNA substrate incubated in the presence of 50 mM KC1. 



- 233 - 



10 



15 



The results shown in Figure 70 demonstrate that both Tth DNAP and Exo III 
were able to detect single base changes in a single- stranded DNA substrate relative to 
a wild-type DNA substrate. The patterns generated by Tth DNAP were comparable to 
those generated by Cleavase™ BN for all three DNA substrates (See Figure 32 for a 
comparison of the pattern generated by Cleavase™ BN). 

The patterns generated by Exo III were entirely distinct from those generated 
by enzymes derived from the genus Thermus ( f .c, Cleavase™ BN and Tth DNAP). 
Furthermore, the pattern produced by cleavage of the DNA substrates by Exo III were 
distinct depending on which concentration of KCl was employed in the reaction 
(Figure 70). A distinct pattern change was evident for the 419 mutant at both KCl 
concentrations. As shown in Figure 70, at 0 mM KCl, a band appears in the 40 
nucleotide range in the 419 mutant (lane 5); at 50 mM KCl, the 419 mutant contains 
an additional band in the 70 nucleotide range (lane 8). Pattern changes were not 
discernable for the 422 mutant (relative to the wild-type) in the Exo III digestions; this 
difference in the ability of the E. coU Exo III enzyme to detect single base changes 
could relate to the relative positions of the changes with respect to secondary structures 
that act as substrates for the structure specific cleavage reaction, and the position of the 
label (5' or 3' end) relative to the preferred cleavage site (5' or 3'), Figure 71. 



20 



25 



d) The Drosophila Rrpl Enzyme Can Be Used to 
Generate Cleavage Patterns 

Another protein in the Exo III family of DNA repair endonucleases, Rrpl from 
DrosophUa melanogaster (Nugent, M, Huang, S.-M., and Sander, M. Biochemistry, 
1993: 32, pp. 11445-11452), was tested for its ability to generate a distinct cleavage 
pattern on a single-stranded DNA template. Because its characteristics in the cleavage 
assay were unknown, this enzyme was tested under a variety of buffer conditions. 
Varying amounts of this enzyme (1 ng or 30 ng) were incubated with approximately 
100 fmoles of the 157 nucleotide fragment 3of the sense strand of exon 4 of the 
tyrosinase gene (SEQ ID NO: 47) in either 1 mM MnCl 2 or 5 mM MgCl 2 and either 
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1 X CFLP™ buffer, pH 8.2 or 1 X CFLP™ buffer, pH 7.8, with 10 mM NaCl. 
Samples were heated to 95*C and begun by the addition of a diluted enzyme mix 
containmg either 1 or 30 ng of RrpI in 1 X CFLP™ buffer. Reactions were earned 
out at 30°C for either 5 or 30 minutes. The results (data not shown) indicated that this 
enzyme generates a weak, but distinct cleavage pattern on a single-stranded DNA 
template. 

e) The Radl/RadlO Complex Can Be Used To Generate 
Cleavage Patterns 

The Radl-RadlO endonuclease (Radl/10) from S. cerevisiae is a specific 3' 
endonuclease which participates in nucleotide excision repair in yeast. This enzyme is 
a heterodimer consisting of two proteins, Radl and RadlO. Radl and RadlO alone do 
not have enzymatic activity. Radl/10 recognizes structures comprising a bifurcated 
DNA duplex and cleaves the single- stranded 3' arm at the end of the duplex 
[Bardwell, A.J et al. (1994) Science 265:2082]. In this sense Radl/10 shares the same 
substrate specificity as does the Cleavase™ BN enzyme. However, the cleavage 
products produced by Radl/10 and Cleavase™ BN differ as the Radl/10 cleaves on 
the 3' single-stranded arm of the duplex while Cleavase™ BN cuts on the 5' single- 
stranded arm. 

Figure 71 provides a schematic drawing depicting the site of cleavage by these 
two enzymes on a bifurcated DNA duplex (formed by the hairpin structure shown). In 
Figure 71, the hairpin structure at the top shows the site of cleavage by a 5' nuclease 
(eg., Cleavase™ BN). The hairpin structure shown at the bottom of Figure 71 shows 
the site of cleavage by an enzyme which cleaves at the 3' single- stranded arm (e.g., 
Radl/10). Enzymes which cleave on the 5' single- stranded arm are referred to as 
Cleavase™ 5' enzymes; enzymes which cleave on the 3' single- stranded arm are 
referred to as Cleavase™ 3' enzymes. 

In order to determine whether the Radl/10 protein is able to detect single base 
changes in DNA substrates, the cleavage patterns created by cleavage of DNA 
substrates by the Radl/10 and Cleavase™ BN enzymes were compared. In this 
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comparison the following substrates were used. The 157 nucleotide fragment from the 
wild type (SEQ ID NO:47), the 419 mutant (SEQ ID NO:54) and the 422 mutant 
(SEQ ID NO:55) alleles derived from the sense strand of exon 4 of the human 
tyrosinase gene was generated containing a biotin label at the 5' end as described in 

5 Example 11. 

The Radl and RadlO proteins were generously provided by Dr. Errol C. 
Friedberg (The University of Texas Southwestern Medical Center, Dallas). The 
Radl/10 complex was prepared by mixing Radl and RadlO proteins in IX dilution 
buffer (0.5% NP40, 0.5% TWEEN 20, 20 mM Tris-HCl, pH 8.0, 50 mM KC1, 10 
10 Kig/ml BSA) to achieve a final concentration of 0.1 mM of each protein. 

Cleavage reactions using the Radl/10 endonuclease were performed as follows. 
The substrate DNA and 15 ng (0.1 pmole) of Radl/10 complex in 1 M l of IX dilution 
buffer were mixed on ice in 10 ul of IX CFLP™ buffer pH 7.8, 1 mM MnCl 2 . The 
reaction was then incubated at 37°C for 5 minutes. The cleavage reaction was stopped 
15 by addition of 6 ul of stop buffer. 

Cleavage reactions using the Cleavase™ BN enzyme were done exactly as 
described above for the Radl/10 cleavages with the exception that 10 ng of the 
Cleavase™ BN enzyme was added and the incubation at 37°C was performed for 3 
minutes. Uncut or no enzyme controls were run for each substrate DNA and were 
20 prepared as described for the reactions containing enzyme with the exception that 

sterile water was added in place of the enzyme (data not shown). 

The cleavage products (3 ul each) were separated by electrophoresis through a 
10% denaturing polyacrylamide gel, transferred to a membrane and visualized as 
described in Example 21. The resulting autoradiograph is shown in Figure 72. 
25 Figure 72 shows the resulting patterns corresponding to the cleavage products 

of the sense strand of the wild type tyrosinase substrate (SEQ ID NO:47) (lanes 1 and 
4), the 419 mutant (SEQ ID NO:54) (lanes 2 and 5) and the 422 mutant (SEQ ID 
NO:55) (lanes 3 and 6). Lanes 1-3 show the cleavage pattern created by incubation of 
the three substrate DNAs with the Cleavase™ BN enzyme and lanes 4-6 show 
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deavage patterns created by incubation of the three substrate DNAs with the RadlflO 
enzyme. Lanes marked "M" contatn molecular weight markers prepared as desenbed 
in Example 10. 

The results shown in Figure 72 demonstrate that the Radl/10 enzyme was able 
to produce distinctive cleavage patterns from the substrate DNAs (lanes 4-6); the 
average product length produced by cleavage of the substrate was longer than that 
produced by Cleavase™ BN. Importantly, the results shown in Ftgure 72 demonstrate 
that the single base substitutions found in the mutant tyrosinase substrates resulted m 
the production of specific changes in the otherwrse similar cleavage patterns of 
tyrosinase substrates (compare lanes 5 and 6 with lane 4). Note that in the digestton 
of the mutant 419 substrate with Radl/10, the bands below about 40 nucleotides have 
lower intensity and one band ,s absent, when compared to wild-type, wh.le m the 
digest of the mutant 422 substrate several new bands appear in the range of 42-80 
nuc.eot.de, Since both enzymes were tested usmg the same reaction conditions, these 
results show that Radl/10 was able to detect the same differences in DNA secondary 
structure that were recogmzed by Cleavase™ BN. Radl/10 generates a afferent 
cleavage pattern relative to that produced by Cleavase™ BN, smce cleavage takes 
place at the 3' end of DNA hairpins producing inherently longer fragments when the 
substrate contains a 5' end label. 

EXAMPLE 28 

Detection Of Mutations In The Human p-Globin 
Gene Using Double-Stranded DNA Substrates 

The results shown in Example 15 demonstrated that single base changes in 
fragments of the P -globin gene can be detected by cleavage of smgle-stranded DNA 
substrates with the Cleavase™ BN enzyme. In tins example it is shown that mutations 
in the P-globm gene can be detected by cleavage of double-stranded DNA substrates 
using the Cleavase™ BN enzyme. 
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Double-stranded substrate DNA comprising 536 bp fragments derived from the 
wild-type p-globm gene (SEQ ID NO:69), mutant 1 (SEQ ID NO:71) and mutant 2 
(SEQ ID NO:72) were generated containing a 5' biotin label on the sense strand using 
the PCR. PCR amplification of these substrates was done as described in Example 
15a. Gel purification and isolation of double- stranded fragments was performed as 

described in Example 21a. 

The cleavage reactions were performed as described in Example 21c. Briefly, 
2 ul of stock DNA (80 ng) in 10 mM Tris-HCl pH 8.0, 0.1 mM EDTA was mixed 
with 3 ^1 H 2 0 and denatured at 95°C for 20 seconds. The denatured DNA was cooled 
to 70°C and a mixture consisting of 2 ul of 5X CFLP™ buffer pH 7.5, 2 ul of 2 mM 
MnCl 2 and 1 ul (25 ng) of the enzyme Cleavase™ BN in dilution buffer was added to 
start the cleavage reaction. The cleavage reactions were stopped after 1 minute by the 
addition of 6 ul of stop buffer. Control uncut reactions were performed as described 
above with the exception that of 1 ul of H 2 0 was used in place of 1 ul of the 
Cleavase™ BN enzyme. The cleavage products (5 ul each) were separated by 
electrophoresis through a 6% denaturing poly aery lamide gel, transferred to a 
membrane and visualized as described in Example 21. The resulting autoradiograph is 

shown in Figure 73. 

Figure 73 shows the cleavage patterns which correspond to the cleavage of the 
sense strand of the wild type P -globin 536 bp fragment (lane 4), mutant 1 fragment 
(lane 5) and mutant 2 fragment (lane 6). Lanes 1-3 show the uncut controls for wild- 
type, mutant 1 and mutant 2 substrates, respectively. The lane marked "M" contains 
biotinylated molecular weight markers prepared as described in Example 10. 

As shown in Figure 73, the base substitution present in mutant 1 results in a 
reduction in the intensity of a band which migrates close to the uncut DNA (lane 5), 
when compared to wild-type cleavage pattern. The base substitution present in mutant 
2 results in the disappearance of the band present in the region just above major 
product band (approximately 174 nucleotides), when compared to the wild-type 
cleavage pattern. 
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For the double-stranded cleavage reactions described above, different reaction 
conditions were used than those employed for the cleavage of the single- stranded P- 
globin DNA substrates described in Example 15. The conditions employed for the 
cleavage of the double- stranded substrates used a lower MnCl 2 concentration, no KC1 
was added, a higher temperature and shorter time course relative to the conditions used 
in Example 15. Although the cleavage patterns generated by cleavage of the double- 
stranded and single-stranded p-globin DNA were slightly different, the positions of the 
pattern changes for mutants 1 and 2 are similar to those demonstrated in Example 15, 
and it was possible to detect the base substitutions in both double-stranded cases. 
These results show that the subtle changes in DNA secondary structure caused by 
single base substitutions in larger DNA substrates can be detected by the Cleavase™ 
BN enzyme whether a single- or double-stranded form of the DNA substrate is 
employed. 

EXAMPLE 29 

Identification Of Mutations In The Human p-Globin Gene CFLP™ 
Patterns Of Unknowns By Comparison To An Existing Library of Patterns 



The results shown in Examples 15 demonstrated that Cleavase™ BN enzyme 
generates a unique pattern of cleavage products from each p-globin substrate tested. 
Differences in banding patterns were seen between the wild-type and each mutant; 
20 different banding patterns were seen for each mutant allowing not only a 

discrimination of the mutants from the wild-type but also a discrimination of each 
mutant from the others. To demonstrate that the products of the Cleavase™ reaction 
can be compared to previously characterized mutants for purposes of identification and 
classification, a second set of p-globin mutants were characterized and the CFLP™ 
patterns, by comparison to the set analyzed in Example 15, were used to determine if 
the mutants in the second set were the same as any in the first set, or were unique to 
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the second set. Although these isolates have all been described previously (specific 
references are cited for of these isolates at the end of this example), the experiment 
was performed "blind", with the samples identified only by a number. 

Five p-globin mutants were compared to the CFLP™ patterns from the first 

5 set: the wild type (5-globin gene (SEQ ID NO:69) or mutant 1 (SEQ ID NO:71), 

mutant 2 (SEQ ID NO:72)or mutant 3 (SEQ ID NO:70). Plasmids for containing 
these 5 new isolates were grown and purified, and single- stranded substrate DNA, 534 
or 536 nucleotides in length, was prepared for each of the 5 (3-globin genes as 
described above in Example 15a. Cleavage reactions were performed and reaction 

0 products were resolved as described in Example 15; the resulting autoradiograph is 

shown in Figure 74. 

In Figure 74, two panels are shown. Panel A shows the reaction products from 
the p-globin isolates described in Example 15 (and as seen in Figure 43). Panel B 
shows the reaction products of the five additional isolates, numbered 4, 5, 6, 7 and 8. 
15 The lanes marked "M" contain biotmylated molecular weight markers prepared as 

described in Example 10. 

By comparison to the CFLP™ patterns shown in Panel A, the isolates shown in 
Panel B can be characterized. It can be seen that the banding pattern of isolate 4 
(Panel B, lanel) is the same as was seen for the wild-type p-globin substrate shown in 

20 Panel A (lane 1); isolate 8 (Panel B, lane 5) is comparable to the previously 

characterized mutant 3 (Panel A, lane 4); isolate number 6 (Panel B, lane3) has 
changes in two areas of the pattern and appears to have features of both isolates 2 
(Panel A, lane 3)and 3 (Panel A, lane 4); isolates 5 and 7 (Panel B, lanes 2 and4, 
respectively) appear to be identical, and they show a pattern not seen in panel A. 

25 To confirm the relationships between the different isolates, the identities of the 

mutations were then determined by primer extension sequencing using thefmole™ 
DNA Sequencing System (Promega Corp., Madison, WI) using the PCR primers [5'- 
biotinylated KM29 primer (SEQ ID NO:67) and 5 '-biotmylated RS42 primer (SEQ ID 
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NO:68)], according to the manufacturer's protocol. The sequencing reactions were 
visualized by the same procedures used for the (3-globin CFLP™ reactions, as 

described in Example 15b. 

The two isolates that matched members of the original set by CFLP™ pattern 
analysis matched by sequence also. Isolate 4 is identical to the wild type sequence 
(SEQ ID NO: 69); isolate 8 is a duplicate of mutant 3 (SEQ ID NO: 70). 

Isolate 6 appears by CFLP™ pattern to have changes similar to both mutant 2 
and mutant 3 of the original set. The sequence of mutant 6 (SEQ ID NO:82) reveals 
that it shares a one base change with mutant 3, a silent C to T substitution in codon 3. 
Mutant 6 also has a G-to-A substitution in codon 26, only 4 bases downstream of that 
found in mutant 2 (SEQ ID NO: 72). This mutation has been shown to enhance a 
cryptic splice site causing a fraction of the mRNA to encode a nonfunctional protein 
[Orkin, S.H., et al. (1982) Nature, 300:768]. It is worthy of note that while mutant 6 
and mutant 2 both showed alteration in the band that migrates at about 200 nucleotides 
(e.g., the band is missing or weak in mutant 2 but appears to be split into 3 weak 
bands in mutant 6) these changes are not of identical appearance. These CFLP™ 
changes, caused by mutations four nucleotides apart, are distinguishable from each 
other. 

The last two isolates, 5 and 7, had the same sequence (SEQ ID NO:83), and 
revealed a single base substitution within the first intron, at IVS position 110. This 
mutation is associated with abnormal splicing leading to premature termination of 
translation of the p-globin protein [R.A. Spritz et al. (1981) Proc. Natl. Acad. Sci. 
USA, 78:2455]. It is worthy of note that the band that disappears in the CFLP™ 
patterns for these mutants (at approximately 260 nucleotides, as compared to the size 
markers) is between the indicative bands in the mutant 1 (at approximately 400 
nucleotides) and mutant 2 (at approximately 200 nucleotides) CFLP™ patterns, and the 
actual mutation (at nucleotide 334 from the labeled 5' end) is between those of 
mutants 1 and 2, at nucleotides 380 and 207, respectively. Thus, the CFLP™ analysis 
not only indicated the presence of a change, but also gave positional information as 
30 well. 
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From the results shown in Figure 74, the unique pattern of cleavage products 
generated by Cleavase™ BN from each of the first four (wild type plus three variants) 
P-globin substrates tested was used as reference to characterize additional P-globin 
isolates. The banding patterns show an overall "familial" similarity, with subtle 
differences {e.g., missing or shifted bands) associated with each particular variant. 
Differences in banding patterns were seen between the wild-type and each mutant; 
different banding patterns were seen for each mutant allowing not only a 
discrimination of the mutant from the wild-type but also a discrimination of each 
mutant from the others. 

EXAMPLE 30 

Effect Of The Order Of Addition Of The Reaction 
Components On The Double- Stranded Cleavage Pattern 

The cleavage reaction using a double-stranded DNA substrate can be considered 
a two-step process. The first step is the denaturation of the DNA substrate and the 
15 second step is the initiation of the cleavage reaction at the target temperature. As it is 

possible that the resulting cleavage pattern may differ depending on the conditions 
present during denaturation {e.g., whether the DNA is denatured in water or in a 
buffer) as well as on the conditions of reaction initiation {e.g., whether the cleavage 
reaction is started by the addition of enzyme or MnCl 2 ) the following experiment was 

20 performed. 

To study the effect of the addition of the reaction components on the resulting 
cleavage pattern, all possible mixing combinations for 4 reaction components (i.e., 
DNA, CFLP™ buffer, MnCl 2 and the Cleavase™ BN enzyme) were varied. A single 
DNA substrate was used which comprised the 536 bp fragment derived from the wild- 

25 type P-globin gene (SEQ ID NO:69). The substrate DNA contained a biotin label at 

the 5' end of the sense strand and was prepared as described in Example 28. 
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The substrate was cut in 8 different cleavage reactions which employed 
different combinations for the addition of the reaction components at the denaturing 
and initiation steps. These reactions are described below. 

Figure 75 shows the resulting patterns generated by cleavage of the sense strand 
of the wild-type P-globin 536-bp substrate (SEQ ID NO:69). In lane 1, the substrate 
DNA (40 fmoles of DNA in 1 ul of 10 mM Tris-HCl, P H 8.0, 0.1 mM EDTA mixed 
with 5 ul H 2 0) was denatured at 95°C for 10 seconds, cooled to 55°C and the reaction 
was started by the addition of a mixture containing 2 ul of 5X CFLP™ buffer with 
150 mM KC1, 1 ul of 2 mM MnCl 2 and 1 ul (50 ng) of the Cleavase™ BN enzyme. 
In lane 2, the DNA was denatured in the presence of 2 ul of 5X CFLP™ buffer and 
reaction was started at 55°C by the addition of 1 ul MnCl 2 and 1 ul (50 ng) of the 
Cleavase™ BN enzyme. In lane 3, the DNA was denatured in the presence of MnCl 2 
and the reaction was started with addition of the buffer and the enzyme. In lane 4, the 
denaturation mixture included the substrate DNA and the enzyme and the reaction was 
started with addition of the buffer and MnCl 2 . In lane 5, the substrate DNA was 
denatured in the presence of CFLP™ buffer and MnCl 2 and then the enzyme was 
added at 55°C. In lane 6, the substrate DNA was denatured in the presence of 
CFLP™ buffer and the enzyme and then MnCl 2 was added at 55°C. Lane 7 shows the 
uncut control. In lane 8, the DNA was denatured in the presence of the enzyme and 
MnCl 2 and then the buffer was added at 55°C. In lane 9, the substrate DNA was 
denatured in the presence of the enzyme, MnCl 2 and the CFLP™ buffer and then the 
mixture was incubated at 55°C for 5 minutes. The lane marked "M" contains 
biotinylated molecular weight markers prepared as described in Example 10. 

In all cases reaction was stopped by addition of 6 ul of stop buffer. The 
25 reaction products (5 ul each) were resolved by electrophoresis through a 10% 

denaturing polyacrylamide gel and the DNA was transferred to a membrane and 
visualized as described in Example 21. The resulting autoradiograph is shown in 
Figure 75. 



15 



20 



- 243 - 



10 



15 



The results shown in Figure 75 demonstrate that most of denaturation-initiation 
protocols employed generated identical cleavage patterns with the exception of the 
reaction shown in lane 3. In the reaction shown in lane 3, the DNA was denatured m 
the presence of MnCl 2 and in the absence of CFLP™ buffer. In the cases where the 
enzyme and MnCl 2 were added before the denaturation step (lanes 8,9) no labeled 
material was detected. In these cases the label was released in a form of short DNA 
fragments which were produced as a result of nibbling (i.e., the exonucleolytic 
removal) of the label from the 5' end of the double-stranded DNA template. 

The results shown in Figure 75 demonstrate that the order of addition of the 
reaction components has little effect upon the cleavage pattern produced with the 
exception that 1) the DNA should not be denatured in the presence of MnCl 2 but in the 
absence of any buffering solution and 2) the Cleavase™ BN enzyme and MnCl 2 should 
not be added together to the DNA prior to the denaturation step. Under these two 
exceptional conditions, the 5' label was removed from the 5' end of the substrate by 
the enzyme resulting in a loss of the signal. 



EXAMPLE 31 

Detection Of Mutations In Human p53 Gene By 
Cleavase™ Fragment Length Polymorphism (CFLP™) Analysis 
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The results shown in preceeding examples demonstrated that the CFLP™ 
reaction could detect single base changes in fragments of varying size from the human 
P-globin and tyrosinase genes and that the CFLP™ reaction could be used to identify 
different strains of virus. The ability of the Cleavase™ reaction to detect single base 
changes in the human tumor suppressor gene P 53 was next examined. Mutation of the 
human p53 gene is the most common cancer-related genetic change; mutations in the 
p53 gene are found in about half of all cases of human cancer. 

The ability of the Cleavase™ BN enzyme to cleave DNA fragments derived 
from the human P 53 gene and to detect single base changes in fragments of the same 
size was examined. Plamsids containing cDNA clones containing either wild type or 
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mutant p53 sequences were used to generate templates for analysis in the CFLP™ 
reaction. The p53 gene is quite large, spanning 20,000 base pairs and is divuhed mto 
1 1 exons The use of a template derived from a cDNA allows for maximization of the 
amount of protein- encoding sequence that can be examined in a DNA fragment of a 
given size. 

The nucleotide sequence of the coding region of the wild type human P 53 
cDNA gene is listed in SEQ ID NO:92. The nucleotide sequence of the coding 
region of the mutant 143 human P 53 cDNA gene is listed in SEQ ID NO:93. The 
nucleotide sequence of the coding region of the mutant 249 (silent) human p53 cDNA 
gene is listed in SEQ ID NO:94. A 601 nucleotide fragment spanning exons 5 through 
8 was generated from each of these three P 53 cDNAs as follows. 

a) Preparation Of The Substrate DNA 

Six double stranded substrate DNAs were prepared for analysis in the 
CFLP™ reaction. The substrates contained a biotin label at either their 5' or 3' end. 
The wild type substrate comprises a 601 nucleotide fragment spanning exons 5 through 
8 of the cDNA sequence of the human P 53 gene (SEQ ID NO:92) [Baker, S. J. et ai, 
Science (1990) 249:912]. Two mutation containing substrates were used. The mutant 
143 substrate (SEQ ID:93) is derived from a P 53 mutant V143A which contains a 
valine (GTG) to alanine (GCG) substitution; this mutation differs from the wild type 
P 53 exon 5-8 fragment by a single nucleotide change [Baker, S. J. et al, Science 
(1990) 249-912]. The mutant 249 (silent) substrate is derived from a P 53 mutant 
which contains a single base change at amino acid 249, from AGG to AGA (SEQ ID 
NO:94). This single base change does not result in a corresponding ammo acid change 
and is therefore referred to as a silent mutation. 

The 601 bp double stranded PCR fragments were generated as follows. The 
primer pair 5 ' -TCTGGGCTTCTTGC ATTCTG (SEQ ID NO:95) and 5'- 
GTTGGGCAGTGCTCGCTTAG (SEQ ID NO:96) were used to prime the PCRs. The 
synthetic primers were obtained from Integrated DNA Technologies (Coralville, IA). 
The primers were biotinylated on their 5' ends with the Oligonucleotide Biotinylation 
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Ki, purchased from USB-Amersham (Cleve.and, OH) according to the manufacturers' 
protocol. When the sense strand was to be analysed in the CFLP™ reaction, the 
primer Hsted in SEQ !D NO:95 was iabeled at the 5' end with the biotin. When the 
anti-sense strand was to be analysed in the CFLP™ reaction, the primer hsted m SEQ 
ID NO-96 was labeled at the 5' end with the biotin. 

The target DNA used in the PCR for the generation of the 601 bp fragment 
derived from the wild type P 53 cDNA was the plasmid CMV- P 53-SN3 [Baker, S. J. 
* al supra]; ,h,s plasmid contams the coding region hsted in SEQ ID NO:92. The 
targei for the generation of the 601 bp fragment derived from the mutant 143 was the 
plasmid CMV-P53-SCX3 [Baker, S. J. e, a,., supra}; tins plasmid contams the codtng 
regton listed in SEQ ID NO:93. REF). The targe, for the generation of the 601 bp 
fragment derived from mutant 249 (silent) was the p.as m id LTR 273 His [Chen, P.-L. 
et al Science (1990) 250:1576); this plasm,d contains the coding region hsted m SEQ 
ID N0 94 DNA was prepared from bacteria harboring each plasmid (plasrmd DNA 
was isolated usmg standard techniques). The 601 bp PCR products were prepared as 
follows. 

The symmetric PCR reactions contained 50 ng of plasmid DNA, 50 pmoles 
primer 5 ' -TCTGGGCTTCTTGC ATTCTG (SEQ ID:95), 50 pmoles of primer 5'- 
GTTGGGCAGTGCTCGCTTAG (SEQ ID:96), 50 pM each dNTP, 20 mM Tris-Cl, 
p H 8 3 1.5 mM M g Cl 2 , 50 mM KC1, with 0.05% TWEEN 20 and 0.05% NONIDET 
P-40 (NP40) in a reaction volume of 95 ul. The reaction mixtures were overlatd w.th 
50 pi ChiUOut™ (MJ Research, Watertown, MA) and the tubes were heated to 95°C 
for 2 5 min Taa DNA polymerase (Promega Corp., Madison, WI) was then added as 
, 25 untts of enzyme in 5jU of 20 mM Tns-Cl, pH 8.3, 1.5 mM M g C 2 , 50 mM KC1, 
with 0.05% TWEEN 20 and 0.05% NONIDET P-40. The tubes were then heated to 
95»C for 45 seconds, cooled to 55°C for 45 seconds and heated to 72»C for 75 
seconds for 34 cycles with a 5 min incubation a. 72°C after the last cycle. 

The PCR products were gel purified as follows. The products were preempted 
by the addition of NaCl to a final concentration of 0.4M, 20 ug glycogen earner and 
500 p. ethanol. The DNA was pelleted by centnfugauon and the PCR products were 
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suspended in 25 or 50 pi sterile distilled water to which was added an equal volume 
of a solution containing 95% formamide, 20 mM EDTA and 0.05% each xylene 
cyanol and bromophenol blue. The tubes were then heated to 85°C for 2 min and the 
reaction products were resolved by electrophoresis through a 6% polyacrylim.de gel 
5 (19:1 cross-link) containing 7 M urea in a buffer containing 45 mM Tris-Borate, P H 
8 3 1 4 mM EDTA. The DNA was visualized by ethidium bromide staining and the 
Ml' bp fragments were excised from the ge. slices by passive diffusion overnight into 
a solution containing 0.5 M NH 4 OAc, 0.1% SDS and 0.1% EDTA. The DNA was 
then precipitated with ethanol in the presence of 4 ug of glycogen earner. The DNA 
,0 was pelleted, resuspended in stenle distilled water and reprecipita.ed by the addttton of 
NaCl to a final aqueous concentration of 0.2 M and 80% ethanol. After the second 
precipitation, the DNA was pelleted and resuspended in 30 ul sterile distilled water or 
TE (10 mM Tris-Cl, pH 8.0 and 0.1 mM EDTA). 

The nucleotide sequnence of these 601 bp templates are listed in SEQ ID 
15 NOS:97-102. The sense strand of the 601 nucleotide wild type fragment is listed in 
SEQ ID NO:97. The anti-sense strand of the 601 nucleotide wild type fragment is 
listed in SEQ ID NO:98. The sense strand of the 601 nucleotide mutant 143 fragment 
is listed in SEQ ID NO:99. The anti-sense strand of the 601 nucleotide mutant 143 
fragment is listed in SEQ ID NO:100. The sense strand of the 601 nucleotide mutant 
20 249 (silent) fragment is listed in SEQ ID NO:101. The anti-sense strand of the 601 

nucleotide mutant 249 (silent) fragment is listed in SEQ ID NO.102. 



25 



b) Cleavage Reaction Conditions 

Cleavage reactions comprised approximately 100 fmoles of the resulting double 
stranded substrate DNAs (the substrates contained a biotin moxety at the 5' end of the 
sense or antisense strand) in a total volume of 5 ul of stenle distilled water. The 
reactions were heated to 95°C for 15 seconds to denature the substrates and then 
quickly cooled to 50°C (this step allows the DNA to assume its unique secondary 
structure by allowing the formation of intra-strand hydrogen bonds between 
complimentary bases). 
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The reactions were performed in either a thermocycler (MJ Research, 
Watertown, MA) programmed to heat to 95°C for 15 seconds and then cooled 

immediately to 50° C. 

Once the tubes were cooled to the reaction temperature of 50°C, the followmg 
components were added: 5ul of a diluted enzyme mix contaming 0.2 ul of Cleavase™ 
BN [50ng/ul 1 X Cleavase™ Dilution Buffer (0.5% NP40, 0.5% TWEEN 20, 20 mM 
Tris-C. P H 8.0, 50 mM KC>, 10 ug/m. BSA)]; 1 ul of 10 X CFLP™ reaction buffer 
(,00 mM MOPS, P H 7.5, 0.5% NP 40, 0.5% TWEEN 20), and 1 ul of 2mM MnCl, 

A no enzyme control (10 ul) was set up in parallel for each PCR fragment 
examined; tins control offered from the above reaction mixture only in that stenle 
distilled water was substituted for Cleavase™ BN enzyme. Reactions were stopped 
after 3 minutes by the addition of 8 ul of stop buffer (95% formamide, 10 mM EDTA, 
0 05% bromophenol blue, 0.05% xylene cyanol) . 

The samples were then heated to 85°C for 2 minutes and 4 ul of each react™ 
mixture were resolved by electrophoresis through a 6% polyacrylimide gel (19:1 cross- 
link), with 7M urea, in a buffer containing 45 mM Tris-Borate, pH 8.3, 1.4 mM 
EDTA. 

After electrophoresis, the gel plates were separated allowing the gel to remain 
flat on one plate. A 0.2 um-pore positively charged nylon membrane (Schleicher and 
Schuell, Keene, NH), pre-wetted with 0.5X THE (45 mM Tris-Borate, pH 8.3, 1.4 mM 
EDTA) was laid on top of the exposed aerylamide gel. All air bubbles trapped 
between the gel and (he membrane were removed. Two pieces of 3 MM filter paper 
(Whatman) were then placed on top of the membrane, the other glass plate was 
replaced, and the sandwich was clamped with binder clips. Transfer was allowed to 
proceed overnight. After transfer, the membrane was carefully peeled from the gel 
and washed in IX Sequencase Images Blocking Buffer (United States Biochermcal) for 
two 15 minute intervals with gentle agitation. Three tenths of a ml of the buffer was 
used per cm' of membrane. A streptavidin-alkaline phosphatase conjugate (SAAP, 
United States Biochemical, Cleveland, OH) was added to a 1:3000 dilution directly to 
the blocking solution, and agitated for 15 minutes. The membrane was washed 3 
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rimes (5 min/wash) in 1 X SAAP buffer (lOOmM Tris-HCl, P H 10; 50 mM NaCl) 
with 0.1% SDS, using 0.5 mb/crf of membrane. The membrane was then washed 
twice in 1 X SAAP buffer without SDS, but containing 1 mM MgCl 2 , drained 
thoroughly and placed in a heat sealable bag. Using a stenle pip* tip, 0.05 mVcm* of 
CDP-Sta r ™ (Tropix, Bedford, MA) was added to the bag and dtstributed over the 
membrane for 5 minutes. The bag was dramed of all excess liquid and air bubbles. 
The membrane was then exposed to X-ray film (Kodak XRP) for an initial 30 mmute 
exposure. Exposure times were adjusted as necessary for resolution and c.anty. The 
result autoradiograph is shown in Figure 79. 

In Figure 79, the lane marked "M" contains biotinlyated molecular weight 
markers The marker fragments were purchased from Amersham (Arlington Heights, 
1L) Lanes 1-4 contain the reaction products from the incubation of double stranded 
DNA substrates in the absence of the Cleavase™ BN enzyme (i.e., uncut controls). 
Lane 1 contains the wild type fragment labeled on the sense strand of the 601 bp PCR 
fragment. Lane 2 contains the mutant 143 fragment labeled on the sense strand of the 
601 bp PCR fragment. Lane 3 contains the wild type fragment labeled on the 
antisense strand of the PCR product. Lane 4 contains the fragment encoding the 
silent mutation at amino actd 249 labeled on the a„t,sense strand of the PCR product. 
Lanes 5-8 contain the reaction products from the incubation of the 601 bp double 
stranded substrates with Cleavase™ BN enzyme. Lane 5 contains products generated 
using the wild type fragment labeled on the sense strand; lane 6 contains products 
generated using the mutant 143 labeled on the sense strand. Lanes 7 and 8 contatn 
products generated using the wild type and mutant 249 (silent) substrates, respectively, 

labeled on the anti- sense strand. 

The results shown in Figure 79 demonstrate that a similar, but distinctly 
different, pattern of cleavage products was generated by the digestion of wild type and 
mutant-containing templates by the Cleavase™ BN enzyme. Companson of lanes 5 
and 6 reveals a difference in the band pattern in the 100 nucleotide range. 
Specifically, the strong band present in the wild type (at around 100 nucleotides) was 
missing in the V143A mutant while two bands immediately below this strong band 
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W ere prominent in .he mutan, and not evident in the wild type. In the 200 nucleot.de 
range a pronounced double, seen in the wM ,ype is missing from .he mutan,, wh,ch 
i„s.ead confined a s.rong single band migrating slightly faster than the wild type 
double,. Sim.larly, comparison of lanes 7 and 8 revealed differences between .he 
5 pattern generated from cleavage of .he anti-sense strand of the wild type fragment and 
.he mu.an. 249 (si.en.) fragment. In the 100 nucleotide range, the wild type fragment 
exhib,ted a strong double, whereas .he upper band of this doublet was missing ,n the 
mutant 249 (silen.) fragment In addition, two prominent bands present in the w,ld 
type pattern in the 150-180 bp range were completely absent from .he mutant 249 

10 (silent) cleavage products. 

Although each mutan. fragment analyzed in Figure 79 differs from the wdd 
type by only one of the 601 nucleotides, a umque pattern of cleavage fragments was 
generated for each. Furthermore, at least one pattern change occured in each mutant 
in the immediate vicininty (i.e., within 10-20 nucleotides) of the DNA sequence 

15 change Th,s experiment demonstrates that CFLP™ is capable of dis.ingu.shmg the 
presence of single base changes in PGR fragments containing exons 5 through 8 of the 
p53 gene. 

EXAMPLE 32 

Detection Of Genetically Engineered 
Mutations In PCR Fragments Of The Human p53 Gene 

The ability of the Cleavase™ BN enzyme to detect single base changes 
generally engineered into PCR fragments containing exons 5 through 8 of the human 
P 53 gene was analyzed. The single base changes introduced were 1) a change from 
argmme (AGG) to serine (AGT) at ammo acid 249 (termed the R249S mutation) and 
2) a change from argmme (CGT) to histidine (CAT) at amino add 273 (termed the 
R273H mutation). Both of these mutations have been found in human tumors and 
have been identified as mutational hot spots [Hollstein et al, Science 253:49 (1991)]. 
The R249S mutation is strongly correlated with exposure to afiatoxin B and/or 
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,nfec,ic„ with hepatitus B virus [Caron de Fromenta. and Soussi, Genes, Chromosomes 
mi Cancer (1992) pp. 1-15]. The R273H mutation anses as a result of a transition at 
a CpG dimucleotide. Such transitions aecount for approximately one-third of the 
known P 53 mu.ationns and are characteristic of a variety of tumor types [Caron de 
Fromental and Soussi, supra; Hollstein et al, supra}. 

Plasmids containing the R249S and R273H mutations were engineered 
according to a variation of a protocol described by R. Higuchi [*■ PCR Tecknology: 
Principles ana ApplicaUons for DNA AmpUflcaiion, H. A. Ehrlich, Ed.(1989) Stockton 
Press NY pp 61-70]. This methodology allows the generation of collection of 
p,asmids containing DNA sequences correspondmg to known p53 mutations. The 
availability of this collection allows the generation of P 53 "bar code" library which 
contains the CFLP™ patterns generated by cleavage of the p53 mutants using the 

Cleavase™ enzymes. 

a) Construction of a 601 bp PCR fragment Containing the R249S 

Mutation 

To generate a 601 bp fragment containing the R249S mutation, a 2-step 
recombinant PCR was performed (see Figure 78 for a schematic representation of the 
2-ste P recombinant PCR). In the firs, or "upstream" PCR, oligonucleotides 5'- 
TCTGGGCTTCTTGCATTCTG (SEQ ID NO:95) and 5'-GAGGATGGGAC 
TCCGGTTCATG (SEQ ID NO:103) were used to amplify a 427 bp fragment 
containing the G to T base change resulting in the R249S mutation; the sequence of 
the 427 bp fragment is listed in SEQ ID NO: 1 1 1 . In the second or "downstream" 
PCR oligonucleotide 5'-CATGAACCGGAGTCCCATCCTCAC (SEQ ID NO.104) 
and 5'-GTTGGGCAGTGCTCGCTTAG (SEQ ID NO:96) were used to amplify a 196 
bp fragment containing the same base change on the complementary strand; the 
sequence of the 196 bp fragment is listed in SEQ ID NO:112. 

For each PCR, 10 ng of a cDNA clone encoding the wild type p53 gene 
(coding region listed in SEQ ID NO-.92) were used as the template in a 50 ul PCR 
reaction. In the case of the upstream fragment, 10 ng of template were added to a 
tube containing 5 picomolcs of the oligonucleonde 5 ' -TCTGGGCTTCTTGCATT CTG 
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CTG (SEQ ID NO:95), 5 pmoles of the oligonucleotide 5 -GAGGATGGGACTCC 
GGTTCATG (SEQ ID NO:103), 50 uM each dNTP, 20 mM Tris-Cl, pH 8.3, 1.5 mM 
MgC,,, 50 mM KC1, with 0.05% TWEEN 20 and 0.05% NONIDET P-40 (NP40) in a 
volume of 45 ul. For the downstream fragment, 10 ng of the wild type template, 
plasmid CMV-p53-SN3 (Example 31) were added to 5 picomoles of the 
oligonucleotide 5 -CATGAACCGGAGTCCCATCCTCAC (SEQ ID NO:104) and 5 
picomoles of the oligonudeotide 5 -GTTGGGCAGTGCTCGCTTAG (SEQ ID 
NO:96), 50 uM each dNTP, 20 mM Tris-Cl, pH 8.3, 1.5 mM MgCl 2 , 50 mM KC, 
with 0 05% TWEEN 20 and 0.05% NONIDET P-40 (NP-40). 

Tubes containing 45 ul of the above mixtures for each template to be amphfied 
were overlaid with 50 ul ChillOut™ (MJ Research, Watertown, MA) and the tubes 
were heated to 95»C for 2.5 min and then cooled to 70»C. Ta q DNA polymerase 
(Promega) was then added as 1.25 units of enzyme in 5ul of 20 mM Tris-Cl, P H 8.3, 
1 5 mM MgCl 2 , 50 mM KC1, with 0.05% TWEEN 20 and 0.05% NONIDET P-40. 
The tubes were then heated to 95°C for 45 seconds, cooled to 55°C for 45 seconds and 
heated to 72°C for 75 seconds for 24 cycles with a 5 min incubation at 72°C after the 
last cycle. 

The PCR products were gel purified as follows. Ten microliters of each PCR 
product were mixed with 10 ,1 of stop buffer (95% formamide, 10 mM EDTA, 0.05% 
bromophenol blue, 0.05% xylene cyanol). The tubes were then heated to 85°C for 2 
min and the reaction products were resolved by electrophoresis through a 6% 
polyacryhrmde ge, (19:1 cross-hnk) containing 7 M urea in a buffer containing 45 mM 
Tris-Borate, pH 8.3, 1.4 mM EDTA (the polyacrylimide solutions used were freshly 
prepared) The DNA was visualized by ethidium bromide staining and the fragment 
was excised from the gel slice by passive diffusion overnight ,nto a solution containing 
0 5 M NH.OAc, 0.1% SDS and 0.1% EDTA at 37° C. 

Ten microliters of each eluted PCR product were combined to serve as the 
recombinant template to prime a second round of PCR. To this template, 10 
picomoles of 5 -biotin exon 8 primer (SEQ ID NO:96), 10 pmoles of 5 -exon 5 
primer (SEQ ID NO:95), 50 ,U each dNTP, 20 mM Tr,s-Cl, P H 8.3, 1.5 mM MgCl 2 , 
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50 mM KC1, with 0.05% TWEEN 20 and 0.05% NONIDET P-40 (NP-40) were 
added Tubes containing 90 nl of the above mixtures for each template to be 
amplified were overlaid with 50 ul ChillOut™ (MJ Research, Watertown, MA) and 
the tubes were heated to 95°C for 2.5 min and then cooled to 70°C. Tag DNA 
polymerase (Promega) was then added as 2.5 units of enzyme in 5 ul of 20 mM Tns- 
Cl P H 8.3, 1.5 mM MgCl 2 , 50 mM KC1, with 0.05% TWEEN 20 and 0.05% 
NONIDET P-40. The tubes were then heated to 95°C for 45 seconds, cooled to 47°C 
to allow the two template molecules to anneal, then heated to 72° C to allow extensron 
of the primers by Tag DNA polymerase. Following this initial cycle of denaturation, 
annealing and extension, 25 cycles in which the reactions were heated to 95°C for 45 
seconds, cooled to 55°C for 45 seconds, and then heated to 72°C for 1 minute were 
carried out, followed by a 5 min extension at 72' C. The fragments were then ethanol 
precipitated and gel purified as described in Example 31. 

b) Construction of a 601 bp PCR Fragment Containing the R273H 
Mutation 

To generate a 601 bp fragment containing the R273H mutation, a 2-step 
recombinant PCR was performed using the procedure described in section a) was used 
to simultaneously amplify PCR fragments encoding a single base change from arginine 
(CGT) to histidme (CAT) at amino acid 273. In the first or "upstream" PCR, 
ohgonucleotkle 5 ' -TCTGGGCTTCTTGC ATTCTG-3 ' (SEQ ID NO:95) and 5'- 
GCACAAACATGCACCTCAAAGCT-3' (SEQ ID NO:105) were used to generate the 
498 bp fragment whose sequence is listed in SEQ ID NO:113. In the second or 
"downstream" PCR, oligonucleotide 5 ' -C AGCTTTGAGGTGC ATGTTTGT-3 ' (SEQ 
ID NO:106) was paired with oligonucleotide 5 ' -GTTGGGCAGTGCTCGCTT AG-3 ' 
(SEQ ID NO:96) to generate a 127 nucleotide fragment whose sequence is listed in 
SEQ ID NO:114. The DNA fragments were electrophoreses eluted, combined and 
used to prime a second round of PCR as described in section a) to generate a 601 bp 
PCR product containing the R273H mutation. 
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c) Sequence Analysis of the 601 Nucleotide PCR Fragments 

The recombinant 601-bp PCR products generated through this two step PCR 
procedure were gel purified as described in Example 31. The PCR products were 
sequenced using the FMOL® DNA Sequencing System (Promega) in conjunction with 
oligonucleotide 5 ' -biotin-GTTGGGC AGTGCTCGCTT AG (SEQ ID NO:96) according 
to manufacturers' standard protocols to verify the presence of the engineered 
mutations. 

The nucleotide sequence corresponding to the sense strand of the 601 
nucleotide R249S mutant fragment is listed in SEQ ID NO:107. The anti-sense strand 
of the 601 nucleotide R249S mutant fragment is listed in SEQ ID NO:108. The sense 
strand of the 601 nucleotide R273H mutant fragment is listed in SEQ ID NO:109. 
The anti-sense strand of the 601 nucleotide R273H mutant fragment is listed m SEQ 
ID NO: 110. 
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d) Cleavage Reactions 

In order to generate ample quantities of DNA for subsequent CFLP™ analysis, 
the 601 bp fragments conatming either the R249S or the R273H mutation were used as 
templates in an additional round of PCR. Approximately 2 FMOLes of each 601 bp 
fragment were added to 20 pmoles of the primers corresponding to SEQ ID NOS:95 
and 96 (SEQ ID NO:96 contained a biotin on the 5' end), 50 uM each dNTP, 20 mM 
Tris-HCl, pH 8.3, 1.5 mM MgCl 2 , 50 mM KC1, 0.05% TWEEN 20 and 0.05% NP40. 
Tubes containing 90 ul of the above mixture were assembled for each template to be 
amplified; the tubes were overlaid with 50 ul ChillOut™ (MJ Research, Watertown, 
MA) and the tubes were heated to 95°C for 2.5 min and then cooled to 70°C. Tag 
DNA polymerase (Promega) was then added as 2.5 units of enzyme in 5 M l of 20 mM 
Tns-Cl, pH 8.3, 1.5 mM MgCl 2 , 50 mM KC1, with 0.05% TWEEN 20 and 0.05% 
NONIDET P-40 The tubes were then heated to 95°C for 45 seconds, cooled to 47°C 
to allow the two template molecules to anneal, then heated to 72° C to allow extension 
of the primers by Tag DNA polymerase. Following this initial cycle of denaturation, 
annealing and extension, 25 cycles in which the reactions were heated to 95°C for 45 
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seconds, cooled to 55°C for 45 seconds, and then heated to 72°C for 1 minute were 
carried out, followed by a 5 min extend at 72° C. The fragments were then ethanol 
precipitated and gel purified as described in Example 31. The gel purified fragments 
were then used in CFLP™ reactions as follows. 

Cleavage reactions comprised approximately 100 FMOLes of the resulting 
double stranded substrate DNAs (the substrates contained a biotin moiety at either the 
5' end of the sense or anti-sense strand) in a total volume of 5 ul (sterile distilled 
water was used to bring the volume to 5 ul). The reactions were heated to 95°C for 
15 seconds to denature the substrates and then quickly cooled to 50°C (this step allows 
the DNA to assume its unique secondary structure by allowing the formation of intra- 
strand hydrogen bonds between complimentary bases). The reaction were performed 
in either a thermocycler (MJ Research, Watertown, MA) programmed to heat to 95°C 
for 15 seconds and then cool immediately to 50° C or the tubes were placed manually 
in a heat block set at 95°C and then transferred to a second heat block set at 50°C. 

Once the tubes were cooled to the reaction temperature of 50°C, 5 ul of a 
diluted enzyme mix containing 0.2 ul of Cleavase™ BN enzyme [50 ng/ul 1 X 
Cleavase™ Dilution Buffer (0.5% NP40, 0.5% TWEEN 20, 20 mM Tns-Cl, pH 8.0, 
50 mM KC1, 10 ug/ml BSA)], 1 ul of 10 X CFLP™ reaction buffer (100 mM MOPS, 
pH 7.5, 0.5% NP 40, 0.5% TWEEN 20), and 1 ul of 2 mM MnCl 2 . A 10 ul no 
enzyme control was set up in parallel for each PGR fragment examined in which 
sterile distilled water was substituted for Cleavase™ BN enzyme. After 2 minutes at 
50°C, the reactions were stopped by the addition of 8 ul of stop buffer (95% 
formamide, 10 mM EDTA, 0.05% bromophenol blue, 0.05% xylene cyanol) . 

The samples were heated to 85°C for 2 minutes and 7 ul of each reaction were 
resolved by electrophoresis through a 10% polyacrylimide gel (19:1 cross-link), with 
7M urea, in a buffer containing 45 mM Tns-Borate, pH 8.3, 1.4 mM EDTA. 

After electrophoresis, the gel plates were separated allowing the gel to remain 
flat on one plate. A 0.2 um-pore positively charged nylon membrane (Schleicher and 
Schuell, Keene, NH), pre-wetted with 0.5X TBE (45 mM Tris-Borate, pH 8.3, 1.4 mM 
EDTA), was laid on top of the exposed acrylamide gel. All air bubbles trapped 
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between the gel and the membrane were removed. Two pieces of 3 MM filter paper 
(Whatman) were then placed on top of the membrane, the other glass plate was 
replaced, and the sandwich was clamped with binder clips. Transfer was allowed to 
proceed overnight. After transfer, the membrane was carefully peeled from the gel 
5 and washed in IX Sequencase Images Blocking Buffer (United States Biochemical) for 

two 15 minute intervals with gentle agitation. Three tenths of a ml of the buffer was 
used per cm 2 of membrane. A streptavidin-alkaline phosphatase conjugate (SAAP, 
United States Biochemical, Cleveland, OH) was added to a 1:3000 dilution directly to 
the blocking solution, and agitated for 15 minutes. The membrane was washed 3 
10 times (5 min/wash) in 1 X SAAP buffer (lOOmM Tris-HCl, pH 10; 50 mM NaCl) 

with 0.1% SDS, using 0.5 mls/cm 2 of membrane. The membrane was then washed 
twice in 1 X SAAP buffer without SDS, but containing ImM MgCl 2 , drained 
thoroughly and placed in a heat sealable bag. Using a sterile pipet tip, 0.05 ml/cm 2 of 
CDP-Star™ (Tropix, Bedford, MA) was added to the bag and distributed over the 
15 membrane for 5 minutes. The bag was drained of all excess liquid and air bubbles. 

The membrane was then exposed to X-ray film (Kodak XRP) for an initial 30 minute 
expsoure. Exposure times were adjusted as necessary for resolution and clarity. The 

results are shown in Figure 80. 

In Figure 80, the lane marked "M" contains biotinlyated molecular weight 

20 markers. The marker fragments were purchased from Amersham (Arlington Heights, 

IL) and include bands corresponding to lengths of 50, 100, 200, 300, 400, 500, 700, 
and 1000 nucleotides. Lanes 1-4 contain the reaction products from the incubation of 
double stranded DNA substrates labeled on the antisense strand in the absence of the 
Cleavase™ BN enzyme. Lane 1 contains the reaction products from the wild type 

25 fragment (SEQ ID NO:98); lane 2 contains the reaction products from the engineered 

R249S mutation (SEQ ID NO:108); lane 3 contains the reaction products from the 249 
(silent) mutation (SEQ ID NO: 102); lane 4 contains the reaction products from the 
engineered R273H mutation (SEQ ID NO: 110). 
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Lanes 5-8 contain the cleavage products generated from the sense strand each 
of these templates when incubated in the presence of the Cleavase™ BN enzyme. 
Lane 5 contains the cleavage products from the wild type fragment (SEQ ID NO:97); 
lane 6 contains the cleavage products from the R249S fragment (SEQ ID NO:107); 
5 lane 7 contains the cleavage products from the 249 (silent) mutant fragment (SEQ ID 

NO:101); lane 8 contains the cleavage products from the R273S fragment (SEQ ID 
NO: 109). 

The results shown in Figure 80 demonstrate that similar, but distinctly different, 
patterns of cleavage were generated from each of these templates containing single- 
10 base changes. Lane 6 shows the attenuation of bands in the 150-180 nucleotide range, 
as well as, the loss of a band in the 100 nucleotide range when compared to the wild- 
type pattern shown in lane 5. In addition, lane 6 shows a new band appearing in 
thel40 nucleotide range, and increased intensity in the top band of a doublet at about 
120 nucleotides. Examination of the silent 249 mutant (lane 7) which differs from 

15 wild-type at the same nucleotide position as R249S (lane 6), revealed pattern 

differences relative to both the wild type (lane 5) as well as to the R249S (lane 6) 
mutation. Specifically, comparison to lane 5 shows an attenuation of bands in the 
150-180 nucleotide range as well as the loss of a band in the 100 nucleotide range, as 
was seen in lane 6. However, the sample in lane 7 does not exhibit the additional 

20 band in the 140 nucleotide range, nor the increased intensity in the top band of the 

doublet in the 120 nucleotide range seen in lane 6. This result demonstrates that the 
CFLP™ technique is capable of distinguishing between changes to a different base at 

the same nucleotide position. 

Examination of the reaction products in lane 8 reveals the loss of a band in the 
25 100 nucleotide range in the R273S frgament when compared to the wild-type pattern 

in lane 5. This CFLP™ pattern is distinct from those in lanes 6 and 7, however, m 
that it does not show attenuation of bands in the 150-180 nucleotide range; in this 
region of the gel this pattern is essentially indistinguishable from that generated from 
the wild type fragment. 
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The above results demonstrate that CFLP™ can be used to detect clinically 
significant mutations in the human p53. Further, these results indicate that the 
CFLP™ technique is sufficiently sensitive to distinguish different base changes at the 
same position from one another, as well as from wild type. In addition these results 
show that the 2-PCR technique can be used to generate a collection of PCR fragments 
containing known P 53 mutations; such a collection allows the generation of a p53 bar 
code library containing the CFLP™ patterns generated by different P 53 mutations. 

EXAMPLE 33 

Detection Of The Presence Of Wild 
Type And Mutant Sequences In Mixed Samples 

The ability of the CFLP™ reaction to detect the presence of different alleles of 
the same sized PCR fragments in a mixed sample, such as might be found in 
heterozygous or otherwise heterogenous tissue, samples was examined. 

PCR products containing a bitoin label on the sense strand were produced and 
purified as described m Example 31 for the wild type P 53 (SEQ ID NO:97) and 
mutant 143 (SEQ ID NO:99) 601-bp fragments. Aliquots of these samples were 
diluted to a final concentration of approximately 12.5 FMOLs/ul and mixed in 
different proportions to give a spectrum of ratios of wild type to mutant DNA. Four 
microliters of the diluted DNA samples, for an approximate total of 50 FMOLs of 
DNA in each sample, mixed in various combinations, were placed in microfuge tubes 
and heated to 95 °C for 15 seconds. The tubes were rapidly cooled to 50°C and 6 „1 
of a diluted enzyme mix containing 0.2 ,1 of Cleavase™ BN [50ng/^l 1 X Cleavase™ 
Dilution Buffer (0.5% NP40, 0.5% TWEEN 20, 20 mM Tns-Cl, P H 8.0, 50 mM KCl, 
10 „g/ml BSA)] , 1 nl of 10 X CFLP™ reaction buffer (lOOmM MOPS, pH 7.5, 0.5% 
NP 40, 0.5% TWEEN 20), and 1 ul of 2mM MnCl, A 10 ul no enzyme control was 
set up in parallel for each PCR fragment examined, with the difference that sterile 
distilled water was substituted for the Cleavase™ BN enzyme. After 1.5 minutes at 
50°C, the reactions were stopped by the addition of 8 ul of stop buffer (95% 

- 258 - 



c9 




10 



15 



20 



25 



formamide, 10 mM EDTA, 0.05% bromophenol blue, 0.05% xylene cyanol) . In 
addition, 4 ul of wild type only as well as 4 ul of V143A only were analyzed by the 
same method for comparison to the mixed samples. 

Samples were heated to 85°C for 2 minutes and 7 p.1 of each reaction were 
resolved by electrophoresis through a 10% polyacrylimide gel (19:1 cross-link), with 
7M urea, in a buffer containing 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA. 

After electrophoresis, the gel plates were separated allowing the gel to remain 
flat on one plate. A 0.2 um-pore positively charged nylon membrane (Schleicher and 
Schuell, Keene, NH), pre-wetted with 0.5X THE (45 mM Tris-Borate, pH 8.3, 1.4 mM 
EDTA), was laid on top of the exposed acrylamide gel. All air bubbles trapped 
between the gel and the membrane were removed. Two pieces of 3MM filter paper 
(Whatman) were then placed on top of the membrane, the other glass plate was 
replaced, and the sandwich was clamped with binder clips. Transfer was allowed to 
proceed overnight. After transfer, the membrane was carefully peeled from the gel 
and washed in IX Sequencase Images Blocking Buffer (United States Biochemical) for 
two 15 minute intervals with gentle agitation. Three tenths of a ml of the buffer was 
used per cm 2 of membrane. A streptavidin- alkaline phosphatase conjugate (SAAP, 
United States Biochemical, Cleveland, OH) was added to a 1:3000 dilution directly to 
the blocking solution, and agitated for 15 minutes. The membrane was washed 3 
times (5 min/wash) in 1 X SAAP buffer (lOOmM Tris-HCl, pH 10; 50 mM NaCl) 
with 0.1% SDS, using 0.5 mls/cm 2 of membrane. The membrane was then washed 
twice in 1 X SAAP buffer without SDS, but containing ImM MgCl 2 , drained 
thoroughly and placed in a heat sealable bag. Using a sterile pipet tip, 0.05 ml/cm 2 of 
CDP-Star™ (Tropix, Bedford, MA) was added to the bag and distributed over the 
membrane for 5 minutes. The bag was drained of all excess liquid and air bubbles. 
The membrane was then exposed to X-ray film (Kodak XRP) for an initial 30 minute 
exposure. Exposure times were adjusted as necessary for resolution and clarity. The 
resulting autoradiograph is shown in Figure 81. 
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In Figure 81, the lane marked "M" contains biotinylated molecular weight 
markers obtained from Amersham (Arlington Heights, IL) and include bands 
corresponding to lengths of 50, 100, 200, 300, 400, 500, 700, and 1000 nucleotides. 
Lanes 1 and 2 contain the reaction products from the no enzyme controls for the wild 
5 type and V143A mutant fragments, respectively. Lane 3 contains cleavage products 

from the sample containing the wild type fragment only. Lane 4 contains cleavage 
products from wild type and mutant fragments mixed in a 1:1 ratio. Lane 5 contains 
cleavage products from a reaction containing a 1: 2 ratio of wild type to mutant 
fragment. Lane 6 contains reaction products present in a ratio of wild type to mutant 
10 of 1 :9. Lane 7 contains cleavage products from a sample containing V143A mutant 

DNA only. Lane 8 contains cleavage products mixed at a ratio of wild type to mutant 
of 2:1. Lane 9 contains cleavage products mixed at a ratio of wild type to mutant 4:1. 
Lane 10 contains cleavage products mixed a ratio of wild type to mutant to 9:1. 

The results shown in Figure 81 demonstrate that the presence of different 
alleles can be detected in a mixed sample. Comparison of lanes 4-6 and lanes 8-10 
with either lane 3 or lane 7 demonstrates that the lanes containing mixed reactions 
exhibit distinct differences from either sample alone. Specifically, in the 100 
nucleotide region, there is a doublet in the wild type sample that shifts in the mutant 
(see discussion of Figure 80 in Example 31). All three of these bands are present in 
20 the mixed samples (lanes 4-6 and lanes 8-10) whereas only one or the other pair is 

detectable in lanes 3 and 7. 
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EXAMPLE 34 

Detection and Identification of Hepatitis C Virus Genotypes By 
Cleavase™ Fragment Length Polymorphism Analysis 

Hepatitis C virus (HCV) infection is the predominant cause of post-transfusion 
non-A, non-B (NANB) hepatitis around the world. In additon, HCV is the major 
etiologic agent of hepatocellular carcinoma (HCC) and chronic liver disease world 
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wide. Molecular biological analysis of the small (9.4 kb) RNA genome has showed 
that some regions of the genome are very highly conserved between isolates, while 
other regions are subject to fairly rapid mutation. These analyses have allowed these 
viruses to be divided into six basic genotype groups, and then further classified into 
5 several sub-types [Altamirano et al, J. Infect. Dis. 171:1034 (1995)]. These viral 

groups are associated with different geographical areas, and and accurate identification 
of the agent in outbreaks is important in montoring the disease. While only genotype 
1 HCV has been observed in the United States, multiple HCV genotypes have been 
observed in both Europe and Japan. HCV genotype has also been associated with 
10 differential efficacy of treatment with interferon, with Group 1 infected individuals 

showing little response. The ability to identify the genotype of HCV present in an 
infected individual allows comparisons of the clinical outcomes from infection by the 
different types of HCV, and from infection by multiple types in a single individual. 
Pre-screening of infected individuals for the viral type will allow the clinician to make 
15 a more accurate diagnosis, and to avoid costly but fruitless drug treatment. 

In order to develop a rapid and accurate method of typing HCV present in 
infected individuals, the ability of the Cleavase™ reaction to detect and distinguish 
between the major genotypes and subtypes of HCV was examined. Plasmids 
containing DNA derived from the conserved 5' untranslated region of six different 
20 HCV RNA isolates were used to generate templates for analysis in the CFLP™ 

reaction. The HCV sequences contained within these six plasmids represent genotypes 
1 (four sub-types represented; la, lb, lc and Ale), 2 and 3. The nomenclature of the 
HCV genotypes used is that of Simmonds et al. [as described in Altamirano et al, 
supra]. 

25 a) Generation of Plasmids Containing HCV Sequences 

Six DNA fragments derived from HCV were generated by RT-PCR using RNA 
extracted from serum samples of blood donors; these PCR fragments were a gift of Dr. 
M. Altamirano (University of British Columbia, Vancouver). These PCR fragments 
represent HCV sequences derived from HCV genotypes la, lb, lc, Ale, 2c and 3a. 
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The RNA extraction, reverse transcription and PCR were performed using 
standard techniques [Altamirano et al., J. Infect. Dis. 171:1034 (1995)]. Briefly, RNA 
was extracted from 100 |*1 of serum using guanidine isothiocyanate, sodium lauryl 
sarkosate and phenol-chloroform [Inchauspe et al, Hepatology 14:595 (1991)]. 
Reverse transcription was performed according to the manufacturer's instructions using 
a GeneAmp rTh reverse transcriptase RNA PCR kit (Perkin-Elmer) in the presence of 
an external antisense primer, HCV342. The sequence of the HCV342 primer is 5'- 
GGTTTTTCTTTGAGGTTT AG-3 ' (SEQ ID NO:115). Following termination of the 
RT reaction, the sense primer HCV7 [5'-GCGACACTCCACCATAGAT-3' (SEQ ID 
NO: 116)] and magnesium were added and a first PCR was performed. Aliqouts of the 
first PCR products were used in the second (nested) PCR in the presence of primers 
HCV46 [5 ' -CTGTCTTC ACGC AGAAAGC-3 ' (SEQ ID NO:117)] and HCV308 [5'- 
GC ACGGTCT ACGAGACCTC-3 ' (SEQ ID NO.118)]. The PCRs produced a 281 bp 
product which corresponds to a conserved 5' noncodmg region (NCR) region of HCV 

between positions -284 and -4 of the HCV genome [Altramirano et al, J. Infect. Dis. 

171:1034 (1995)]. 

The six 281 bp PCR fragments were used directly for cloning or they were 
subjected to an additional amplification step using a 50 ul PCR comprising 
approximately 100 FMOLes of DNA, the HCV46 and HCV308 primers at 0.1 uM, 
100 uM of all four dNTPs and 2.5 units of Taq polymerase in a buffer containing 10 
mM Tris-HCl, P H 8.3, 50 mM KC1, 1.5 mM MgCl 2 and 0.1% TWEEN 20. The PCRs 
were cycled 25 times at 96°C for 45 sec, 55°C for 45 sec. and 72°C for 1 min. Two 
microliters of either the original DNA samples or the reamplified PCR products were 
used for cloning in the linear pT7Blue T-vector (Novagen, Madison,WI) according to 
manufacturer protocol. After the PCR products were ligated to the pT7Blue T-vector, 
the ligation reaction mixture was used to transform competent JM109 cells (Promega). 
Clones containing the pT7Blue T-vector with an insert were selected by the presence 
of colonies having a white color on LB plates containing 40 ^g/ml X-Gal, 40 jig/ml 
IPTG and 50 jig/ml ampicillin. Four colonies for each PCR sample were picked and 
grown overnight in 2 ml LB media containing 50 ng/ml carbenicillin. Plasmid DNA 
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was isolated using the following alkaline miniprep protocol. Cells from 1.5 ml of the 
overnight culture were collected by centrifugation for 2 min. in a microcentrifuge (14K 
rpm), the supernatant was discarded and the cell pellet was resuspended in 50 ul TE 
buffer with 10 u£/ml RNAse A (Pharmacia). One hundred microliters of a solution 
containing 0.2N NaOH, 1% SDS was added and the cells were lysed for 2 min. The 
lysate was gently mixed with 100 ul of 1.32 M potassium acetate, pH 4.8, and the 
mixture was centifugated for 4 min. in a microcentrifuge (14K rpm); the pellet 
comprising ell debris was discarded. Plasmid DNA was precipitated from the 
supernatant with 200 ul ethanol and pelleted by centrifugation a microcentrifuge (14K 
rpm). The DNA pellet was air dried fori 5 min. and was then redissolved in 50 ul TE 
buffer (10 mM Tris-HCl, pH 7.8, 1 mM EDTA). 

To analyze the cloned HCV inserts, 1 ul of plasmid DNA (approximately 10 to 
100 ng) reamplified in a 50 ul PCR using the HCV46 and HCV308 primers as 
described above with the exception that 30 cycles of amplification were employed. 
15 The PCR products were separated by electrophoresis on a 6% non-denaturing 

acrylamide gel (29:1 cross linked) in 0.5X TBE buffer; clones that gave rise to a 281 
bp PCR product were selected for further analysis. 

For sequencing purposes, plasmid DNA from selected clones was PEG purified 
as follows. To 50 ul of plasmid DNA in TE buffer (approximately 10-100 ng/ul), 25 
20 ul of 5M NaCl and 10 ul 20% PEG (M.W.8,000; Fisher) was added, mixed well, and 

the mixture was incubated on ice for 1 hour. The mixture was then centrifuged for 5 
min in a table-top microcentrifuge (at 14K rpm), the pellet was removed and an 
additional 15 ul of 20% PEG was added to the supernatant. After incubation for 1 
hour on ice, a second pellet was collected by centrifugation, the supernatant was 
25 discarded, and the pellet was redissolved in 20 ul H 2 0. Two microliters of PEG- 

purified plasmid DNA (approximately 100 ng) was used in cycle- sequencing reactions 
using the FMOL® DNA Sequencing System (Promega, Madison, WI) according to 
manufacturer protocol, in conjunction with the HCV46 or HCV308 primers. The 
HCV46 or HCV308 primers were biotinylated at the 5' end using Oligonucleotide 
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Biotin Labeling kit (Amersham, Arlington Heights, IL) prior to use in the sequencing 
reactions. Sequencing reactions were separated on 10% denaturing acrylamide gel, 
transferred on nylon membrane and visualized as described in Example 21. 

Alternatively, DNA sequencing was done using either the Blue-Tl [5'- 
5 GATCT ACTAGTCATATGGAT-3 ' (SEQ ID NO: 119)] and Blue-T2 [5'- 

TCGGT ACCCGGGGATCCGAT-3 ' (SEQ ID NO: 120)] primers labeled at the 5' end 
with tetra chloro fluorescein (TET) dye (Integrated DNA Technologies). In this case, 
the sequencing reactions were separated on a 10% denaturing acrylamide gel and the 
products were visualized using a FMBIO-100 Image Analyzer (Hitachi). The six HCV 
10 clones were termed HCV1.1, HCV2.1, HCV3.1, HCV4.2, HCV6.1 and HCV7.1; the 

double-stranded DNA sequence of these clones are listed in SEQ ID NOS:121-126, 
respectively. The sequence of the sense strand for each of the six HCV clones is 
shown as the top line in SEQ ID NOS:121-126. The sequence of the anti-sense strand 
for HCV clones HCV1.1, HCV2.1, HCV3.1, HCV4.2, HCV6.1 and HCV7.1 is listed 
15 in SEQ ID NOS: 127-132, respectively. 

The DNA sequences of each of the six HCV clones are aligned in Figure 82. 
In Figure 82, nucleotides which represent variations between the six HCV clones are 
indicated by bold type and underlining; dashes are used to indicate gaps introduced to 
maximize alignemnt between the sequences (necessary due to the insertion found in 
20 clone HCV4.2). This alignment shows that these six HCV clones represent six 

different HCV genotypes. HCV1.1 represents a genotype lc HCV; HCV2.1 represents 
a genotype la HCV; HCV3.1 represents a genotype lb HCV; HCV4.2 represents a 
genotype lc HCV; HCV6.1 represents a genotype 2c HCV and HCV7.1 represents a 
genotype 3a HCV. For one sample, HCV4.2, an insertion of an "G" nucleotide was 
25 found at position 146 (relative to the protypical HCV; Altamirano et al, supra), since 

no insertion or deletions in the HCV NCR have been previously reported, a second 
independent clone derived from the PCR products corresponding to HCV4 was 
sequenced. This second HCV4 clone was found to have the same sequence as that 
shown for HCV4.2 in Figure 82. 
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b) Preparation of HCV Substrates 

Six double stranded substrate DNA were prepared for analysis in the CFLP™ 
reaction. The substrates were labelled at the 5' end of either the sense or the anti- 
sense strand by the use of labeled primers in the PCR to permit CFLP™ analysis of 
5 each strand of the HCV DNA substrates. 

To prepare PCR products for CFLP™ analysis, the HCV46 and HCV308 
primers were 5' end labeled with TMR dye using the ONLY™ BODIPY® TMR 
Oligonucleotide Phosphate Labeling Kit (Molecular Probes, Inc., Eugene, OR) 
according to manufacturer protocol. All six HCV 281 bp NCR sequences were PCR 
10 amplified using 10 ng of template and 30 cycles of amplification as described above in 

section a). 

For sense strand analysis, the PCR was conducted using the HCV46 primer 
(SEQ ID NO: 117) labeled with TMR and unlabeled HCV308 primer (SEQ ID 
NO: 11 8). For antisense analysis, the PCR was conducted using unlabeled HCV46 
15 primer (SEQ ID NO:117) and HCV308 primer (SEQ ID NO:118) labeled with TMR. 

The PCR products were purified by electrophoresis on a 6% denaturing acrylamide gel 
and eluted overnight as described above in Example 21. The gel-purified DNA 
substrates were redissolved in 20 (il H 2 0 at an approximate concentration of 100 
FMOLes/ul. 

20 c) Cleavage Reaction Conditions 

Cleavage reactions comprised 1 ul of TMR-labeled PCR products 
(approximately 100 FMOLes of the double-stranded substrates) in a total volume of 10 
ul containing IX CFLP™ buffer (10 raM MOPS, pH 7.5; 0.5% each TWEEN 20 and 
NP-40) and 10 ng Cleavase™ BN enzyme. All components except the MnCl 2 were 
25 assembled in a volume of 8 ul. The reactions were heated to 95°C for 15 seconds to 

denature the substrates and then quickly cooled to 55°C. The reaction were performed 
in either a thermocycler (MJ Research, Watertown, MA) programmed to heat to 95°C 
for 15 seconds and then cool immediately to 55° C or the tubes were placed manually 
in a heat block set at 95°C and then transferred to a second heat block set at 55°C. 
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Once the tubes were cooled to the reaction temperature of 55°C, the cleavage 
reaction was started by the addition of 2 ul of 1 mM MnCl 2 . After 2 minutes at 55°C, 
the reactions were stopped by the addition of 5 jal of a solution containing 95% 
formamide, 10 mM EDTA and 0.02% methyl violet. 

Five microliters of each reaction mixture were heated at 85°C for 2 min, and 
where than resolved by electrophoresis through a 12% denaturing polyacrylamide gel 
(19:1 cross link) with 7M urea in a buffer of 45 mM Tris-Borate, pH 8.3, 1.4 mM 
EDTA. The gels were run at 33 watts for 1.5 hours. The labeled reaction products 
were visualized using the FMBIO-100 Image Analyzer (Hitachi), with the resulting 
fluoroimager scan shown in Figure 83. 

In Figure 83, the CFLP™ patterns produced by cleavage of the six HCV 
samples labeled on the sense strand are shown in lanes 1-6; the CFLP™ patterns 
produced by cleavage of the six HCV samples labeled on the anti-sense strand are 
shown in lanes 7-12. The position of molecular weight markers is indicated on the 
left-hand side of the fluoroimager scan by the large arrowheads; the size of the 
markers is indicated in nucleotides. 

The experiment presented in Figure 83 demonstrates the ability of CFLP™ to 
differentiate six distinct hepatitis C viral subtypes. The six samples in the left hand 
side of the panel (lanes 1-6) were labeled on the 5' end of the sense strand; the six on 
the right (lanes 7-12), on the 5' end of the antisense strand. The first four samples in 
each set all contain samples amplified from HCV type 1. Subtypes a, b, and c are 
represented, as is a single base deletion of type lc (i.e., Ale). Analysis of either 
strand points out numerous similarities as well as several distinctive differences 
between the subtypes. Most notable among the similarities on the sense strand are 
prominent bands marked A, B and C. Specifically, whereas bands B and C are 
evident in the patterns generated from both subtypes la and lb (and are, in fact, more 
prominent in subtype lb than in la), they are barely visible in subtype lc. Band A, 
though present in all 4 of these samples, is more prominent in the patterns generated 
from subtypes lc and la. Differences between subtypes 2c and 3a vs. all of the 
subtype 1 samples, are evident in the region between 50 and 100 nt (compare bands D 
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and E) on the sense strand and between the 80 and 150 nt on the antisense strand 
(compare bands F-J). Viral type 2 gives rise to the most significantly altered CFLP 
pattern, while type 3 appears to be similar to type 1; these relationships appear to be 
consistent with the relative number of sequence differences between the different 
isolates 

The results shown in Figure 83 demonstrate that the CFLP™ method provides 
a simple and rapid method to determine the genotype of HCV strains. This method 
will facilitate the diagnosis of HCV infection, permit appropriate treatment of HCV- 
infected patients, and aid in the moinitoring of HCV outbreaks. 
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EXAMPLE 35 

Detection of Mutations Associated With Antiobiotic Resistance in Mycobacterium 

tuberculosis 



In the past decade there has been a tremendous resurgence in the incidence of 
tuberculosis in this country and throughout the world. Worldwide, the number of new 
15 cases reported annually is forecast to increase from 7.5 million in 1990 to 10.2 million 

by the year 2000. An alarming feature of this resurgence in tuberculosis is the 
increasing numbers of patients presenting with strains of M. tuberculosis which are 
rewistant to one or more antituberculosis drugs [i.e., multi-drug resistant tuberculosis 
(MDR-TB)]. 

20 Resistance to either or both of the antibiotics rifampin (rif) and isoniazid (inh) 

is the standard by which M. tuberculosis strains are judged to be multi-drug reistant. 
Both because of their potent bactericidal activities and because acquisition of primary 
resistance to these drugs is rare (the spontaneous mutation rate of resistance to 
rifampin is approximately 10 8 and to isoniazid, 10" 8 to 10*), until very recently, these 

25 two antibiotics were among the most powerful front-line drugs used to comabt the 

advance and spread of tuberculosis. However surveys of tuberculosis patients in the 
U.S. reveal that as many as one-third were infected with strains resistant to one or 
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more antituberculosis drugs; greater than 25% of the M. tuberculosis cultures isolated 
were resistant to isoniazid and 19% were resistant to both isoniazid and rifampin 
[Frieden et al, New Eng. J. Med. 328:521 (1993)]. 

As discussed above (Description of the Invention), reistance to rifampin is 

5 associated with mutation of the rpoB gene in M. tuberculosis. While the exact 

mechanism of resistance to isoniazid is not clear, the majority (as many as 80%) of 
inh r mutations occur in the katG and inhA genes of M tuberculosis. To investigate 
whether CFLP™ could be used to detect mutations in the genes involved in MDR-TB, 
DNA fragments were amplified from the rpoB and katG genes of M tuberculosis. 

10 DNA fragments derived from wild-type (i.e., antibiotic- sensitive) or mutant (i.e., 

antibiotic-resistant) strains of M. tuberculosis were subjected to CFLP™ analysis. 

a) CFLP™ Analysis of Mutations in the RpoB Gene of M. tuberculosis 
i) Generation of Plasmids Containing RpoB Gene 
Sequences 

15 Genomic DNA isolated from wild-type M. tuberculosis or M. tuberculosis 

strains containing mutations in the rpoB gene associated with rifampin resistance were 
obtained from Dr. T. Schinnick (Centers for Disase Control and Prevention, Atlanta, 
GA). The rifampin resistant strain #13 (91-3083) contains a tyrosine residue at codon 
451 of the rpoB gene in place of the histidine residue found in the wild-type strain 

20 (i.e., H451Y); this mutation is is present in 28% of rifampin resistant TB isolates. The 

H451Y mutation is hereinafter refered to as mutant 1. The rifampin resistant strain 
#56 (91-2763) contains a luecine residue at codon 456 of the rpoB gene in place of the 
serine residue found in the wild-type strain (i.e., S456L); this mutation is present in 
52% of rifampin resistant TB isolates. The S456L mutation is hereinafter refered to as 

25 mutant 2. 

A 620 bp region of the TB rpoB gene was amplified using the PCR from DNA 
derived from the wild-type and mutant 1 and mutant 2 strains. The primers used to 
amplifiy the rpoB gene sequences were PolB-5A [5 ' - ATC AAC ATCCGGCCGGTGGT- 
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3' (SEQ ID NO:133] and PolB-5B [5 ' -GGGGCCTCGCT ACGGACCAG-3 ' (SEQ ID 
NO:134)]; these PCR primers amplify a 620 bp region of the rpoB gene which spans 
both the H451Y and S456L mutations [Miller et al, Antimicrob. Agents Chemother., 
38 805 (1994)]- The PCRs were conducted in a final reaction volume of 50 ul 
containing the PolB-5A and PolB-5B primers at 1 \xM, 1.5 mM MgCl 2 , 20 mM Tris- 
HC1 P H 8.3, 50 mM KC1, 0.05 % each TWEEN-20 and NONIDET P-40 and 60 uM 
of all four dNTPs. The reaction mixture was heated at 95°C for 3 min. Amplification 
was started by the addition of 2.5 units of Tag polymerase and was continued for 35 
cycles at 95°C for 1 min, 60°C for 1 min and 72°C for 2 min. 

To clone the PCR-ampUfied fragments, 1 ul of each PCR product was used for 
Hgation in the linear pTVBlue T-vector (Novagen, Madison,WI). The ligation products 
were used to transform competent JM109 cells and clones containing P T7Blue T- 
vector with an insert were selected by white color on LB plates containing 40 u^/ml 
X-Gal, 40 p.g/ml IPTG and 50 ng/ml ampicillin. For each PCR sample (i.e., wild-type 
and mutants 1 and 2), five independent colonies were picked and grown overnight in 2 
ml of LB media containing 50 jxg/ml carbenicillm. Plasmid DNA was isolated using 
the alkaline miniprep protocol described above in Example 34. 

To analyze the cloned fragments, 1 ul of plasmid DNA from each clone was 
amplified by PCR using 50 ul reaction containing the PolB-5A and PolB-5B primers 
at 1 uM, 1.5 mM MgCl 2 , 20 mM Tris-HCl, pH 8.3, 50 mM KC1, 0.05 % each 
TWEEN-20 and NONIDET P-40, 60 uM of all 4 dNTPs and 2.5 units of Tag 
polymerase. The PCRs were cycled 35 times at 95°C fori min, 60°C for 1 min and 
72°C for 2 min. The PCR products were separated by electrophoresis on a 6% native 
polyacrylamide gel in 0.5X TBE buffer and clones that gave rise to a 620 bp fragment 
were selected for further analysis. 

For sequencing purposes, plasmid DNA from selected clones was PEG-purified 
as described in Example 34. Two microliters of PEG-purified plasmid DNA 
(approximately 100 ng) was used for cycle-sequencing with FMOL R kit (Promega, 
Madison, WI) in conjunction with the PolB-5A and PolB-5B primers containing a 
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biotin moity at the 5' end. Biotinylation of the primers was performed using an 
Oligonucleotide Biotin Labeling kit (Amersham). Sequencing reactions were separated 
in a 8% denaturing polyacrylamide gel, transferred to a nylon membrane and 
visualized as described above in Example 21. The DNA sequences of the 620 bp rpoB 
5 gene fragment derived from the wild-type, mutant 1 and mutant 2 strains are listed in 

SEQ ID NOS-.135-137. The sequence of the sense strand for each of the three TB 
strains is shown as the top line in SEQ ID NOS:135-137. The sequence of the anti- 
sense strand for the wild-type, mutant 1 and mutant 2 TB strains is listed in SEQ ID 

NOS:138-140, respectively. 
1Q ii) Preparation of M. tuberculosis rpoB Gene Substrates 

In order to generate substrates for use in CFLP™ reactions, the cloned 620 bp 
fragment derived from the wild type and mutants 1 and 2 rpoB gene were amplified 
using the PCR. The PCRs were conducted using one primer of the primer pair labeled 
at the 5' end so that the resulting PCR product would permit the analysis of either the 
1 5 sense or anti-sense strand of the rpoB gene fragmnets. In order to generate substrates 

labelled on the anti-sense strand, ten nanograms of plasmid DNA from the sequenced 
clones was used as the template in 50 ul reactions containing 1 jiM of each the PolB- 
5A primer (unlabelled) and PolB-5B primer biotinylated at the 5' end using 
Oligonucleotide Biotin Labeling kit (Amersham), 1.5 mM MgCl 2 , 20 mM Tris-HCl, 
20 P H 8.3, 50 mM KC1, 0.05 % each TWEEN-20 and NONIDET P-40, 60 uM of all 4 

dNTPs and 2.5 units of Tag polymerase. The reactions were cycled 35 times at 95°C 
for 1 min, 60°C for 1 min and 72°C for 2 min. The resulting 620 bp PCR products 
containing a biotin-labeled antisense strand were gel-purified as described in Example 
21. The purified fragments were dissolved in 20 |il H 2 0. 
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To generate substrates labelled on the sense strand of the 620 bp fragment of 
rpoB gene fragments (wild-type and mutats 1 and 2), the PCRs were conducted using 
1 uM each PolB-5A primer 5' end labeled with TMR dye using ONLY™ BODIPY R 
TMR Oligonucleotide Phosphate Labeling Kit (Molecular Probes, Inc., Eugene, OR) 

5 and unlabeled PolB-5B primer. The PCR reactions also contained 1 .5 mM MgCl 2 , 20 

mM Tris-HCl, pH 8.3, 50 mM KC1, 0.05 % each TWEEN-20 and NONIDET P-40, 60 
uM of all 4 dNTPs, 5 units of Tag polymerase and 10 ng of plasrmd DNA from the 
sequenced clones as a template in a final volume of 100 jd. The reasons were cycled 
for a total of 35 cycles comprising 95°C for 1 min, 60°C for 1 mm and 72°C for 2 

10 min. 

In addition to the above PCR conditions, the PCR reactions were also 
conducted using dUTP in place of dTTP to generate undine-containmg PCR 
fragments. Uridine-containing PCR fragments have become the standard type of PCR 
fragment analyzed in clinical laboratories. In order to demonstrate that undine- 
15 containing PCR fragmrnts can be used to produce distinct CFLP™ patterns from 

substrates which vary by a single base pan change within a 620 bp fragment, rpoB 
gene fragments containing a 5' TMR label on the sense strand and uridine in place of 
thymidine were generated as follows. Uridine-containing 620 bp fragments (wild-type 
and mutants 1 and 2) were amplified according to the PCR protocol described above 
20 for the generation of fragments labelled at the 5' end of the sense strand with TMR 

with the exception that 2.5 mM MgCl 2 was used in place of 1.5 mM MgCl 2 and 100 
uM dATP, 100 uM dCTP, 100 [iM dGTP and 200 uM dUTP were used in place of 
the mixture containing 60 »M each of all 4 dNTPs (,e., dATP, dCTP, dGTP and 
dTTP). 

25 The 620 bp PCR products containing a TMR-labeled sense strand (either 

uridine- or thymidine-containing) were purified in 6% denaturing gel as described 
above, eluted overnight, precipitated with ethanol and redissolved in 20 ul H 2 0 as 
described in Example 21, for a concentration of approximately 15 FMOLes/^1. 
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Hi) Cleavage Reaction Conditions 

Cleavage reaction conditions for analysis of the 620 bp rpoB fragments 
containing a biotin-labelled antisense strand were as follows. Six microliters of biotin 
labeled PCR product were combined with 1 ul of 10X CFLP™ buffer (100 mM 
MOPS, P H 7.5, 0.5% each TWEEN 20 and NP-40) and 25 ng Cleavase™ BN 
enzyme. Prior to the initiation of the cleavage reaction, the DNA mixtures were 
denatured by incubation at 95°C for 10 sec. The reactions were then cooled to 60°C 
and reaction was started by the addition of 1 ul of 2 mM MnCl, The cleavage 
reactions were conducted at 60°C for 2 min. Cleavage reactions were stopped after 2 
min. by adding 5 ul of STOP solution (95% formamide, 10 mM EDTA and 0.02% 
each bromphenol blue and xylene cyanol). Six microliters of each sample were 
resolved by electrophoresis on a 6% denaturing polyacrylamide gel and labeled 
fragments were visualized as described in Example 21. The resulting autoradiogram is 

shown in Figure 84. 

In Figure 84, the lane marked "M" contains biotinylated molecular weight 
markers obtained from Amersham (Arlington Heights, IL) and include bands 
corresponding to lengths of 200, 300, 400, 500 nucleotides. The size of the markers 
and of the uncleaved rpoB substrates (620) is indicated on the left-hand side of the 
autoradiograph using large arrowheads. Lanes 1-3 contain the reaction products 
generated by the cleavage of the mutant 1, wild-type and mutant 2 substrates labelled 
on the anti-sense strand, respectively. The distance of the point mutation (relative to 
the wild-type sequence) from the 5' end label was 511 nucleotides for the mutant 1 
substrate and 499 nucleotides for the mutant 2 substrate. 

The results shown in Figure 84 demonstrate that similar, but distinctly different 
patterns of cleavage were generated from the each of the rpoB substrates labelled on 
the anti-sense strand. In comparison with the cleavage pattern generated by the wild- 
type substrate, the pattern generated by cleavage of the mutant 1 substrate shows a 
disappearance of Band A. A comparison of the pattern generated by cleavage of the 
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wild-type and mutant 2 substrates shows that the mutant 2 substrate has a significant 
reduction of insensity of Band B. Thus, the two mutants can be distinguished from 
the wild-type and from each other. 

Cleavage reaction conditions for analysis of the 620 bp rpoB fragments 
5 containing a TMR-labelled sense strand were as follows. Four microliters of TMR- 

labeled PCR product were cleaved as described above. Cleavage reactions were 
stopped after 2 min. by adding 5 ul 95% formamide, 10 mM EDTA and 0.02% of 

methyl violet (Sigma). 

The reactions were heated to 85°C for 2 min. and five microliters of each 

10 reaction mixture were resolved by electrophoresis through a 12% denaturing 

polyacrylamide gel (19:1 cross link) with 7M urea in a buffer containing 45 mM Tris- 
Borate, pH 8.3, 1.4 mM EDTA. The gel was run at 33W (watts) for 1.5 hours. The 
labeled reaction products were visualized using the FMBIO-100 Image Analyzer 
(Hitachi) with the resulting fluoroimager scan shown in Figure 85, PAnel A. the gel 

15 was then electrophoresed for another 1 hour, and the second scan is shown in Panel 

B.. 

In Figure 85, two panels, A and B, are shown. Panel B represents a scan of 
the same gel shown in Panel A following a longer period of electrophoresis than that 
shown in Panel A. Thus, Panel B serves to spread out the banding pattern seen in the 
20 upper portion of Panel A (lines connecting Panels A and B show the region of 

expansion). In Figure 85, Panels A and B, lanes 1-4 contain the reaction products 
produced by cleavage of thymidine-containing substrates having a TMR-label on the 
sense strand derived from the mutant 1, wild-type, mutant 2 and a mixture of the wild- 
type and mutant 2 substrates, respectively. Lane 5 of Panels A and B contains the 157 
25 bp fragment derived from exon 4 of the tyrosinase gene (SEQ ID NO:40) labeled with 

TET as a marker. Lanes 6-9 of Panels A and B contain the reaction products 
produced by cleavage of uridine-containing substrates having a TMR-label on the 
sense strand derived from the mutant 1, wild-type, mutant 2 and a mixture of the wild- 
type and mutant 2 substrates, respectively. Mixtures of the wild-type and mutant 2 
30 substrates (lanes 4 and 9) were generated by mixting together 5 ul of each substrate 
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after the cleavage reaction; 6 p.1 of the mixture was then loaded on the gel. The 
distance of the point mutation (relative to the wild-type sequence) from the 5' end 
label was 100 nucleotides for the mutant 1 substrate and 116 nucleotides for the 
mutant 2 substrate. 

5 The results shown in Figure 85 demonstrate that similar, but distinctly different 

patterns of cleavage were generated from the each of the rpoB substrates labelled on 
the sense strand. The left hand set of each panel contains CFLP patterens generated 
from PCR products containing dNTPs, while the right hand side contains CFLP 
patterns generated from PCR products in which dUTP was substituted for dTTP. 
10 Comparision of the CFLP patterns generated from dNTP-containing amplicons of 

mutant 1 and wild-type reveals a marked reduction in intensity of a band 
approximately 80 nt from the labeled 5' end (band A), in the vicinity of the sequence 
change in this mutant (100 bp from the labeled 5' end). In addition, a band migrating 
at approximately 200-250 nt from the labeled 5' end (band B) is missing in mutant 1. 
15 In contrast, comparision of the patterns generated from wild-type and muatnt 2 reveals 

the loss of a band 120 nt from the labeled 5' end (band C). Furthermore, examination 
of the region of the gel corresponding to 120 nt shows, particularly in Panel B, that 
band D is shifted downward in mutant 2 relative to wild-type. In Panel B, another 
band, migrating just above band D (labeled band D') also appears to be shifted 
downward in mutant 2 relative to wild-type. Lane 4 of each panel, in which aliquots 
from the wild-type and mutant CFLP reactions were mixed prior to electrophoresis 
demonstrates that this shift (in band D') in mutant 2 is real and not due to an 

electrophoresis artifact. 

Examination of the CFLP patterns generated from the dUTP-containing 
25 amplicons demonstrates that the ability to distinguish these mutants from one another, 

as well as from the wt, is not adversely affected by substitution of dUTP for dTTP and 
may, in fact, be enhanced. In this example, both mutants 1 and 2 are more readily 
distinguished from the wt when the patterns are generated from amplicons containing 
dUTP than dTTP. In the right-hand portion of panel A, comparison of the lanes 
30 containing mutant 1 and wt reveals several distinctive differences between the two 
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amplicons, while others are new and unanticipated. Specifically, band A is reduced in 
intensity in the mutant, as compared to the wt, in much the same way that it is in the 
left-hand portion of this panel. A band migrating at approximately 1 10 nt (band E) 
appears to be missing from the mutant, as does a band at approximately 250 nt 

5 (compare to band B in the left-hand portion of the gel). In addition, the strong band 

labeled F, while not noticeably different in the three samples containing dTTP, is 
much stronger in the wt pattern generated from dUTP-containing amplicons than it is 
in the mutants. Comparison of the patterns generated from wt and mutant 2 also 
reveals a number of pronounced differences. Most notably, a band migrating at 

10 approximately 60 nt appears in mutant 2 (band G), as does a complex of 2 new bands 

migrating at approximately 150 nt (band H). Interestingly, while some of the elements 
that make each of these patterns distinct from one another are different if dUTP is 
substituted for dTTP in the PCR amplification, the vast majority of the cleavage 
fragments are identical in the two experiments. This result suggests that substitution 

15 of dUTP results in subtle alterations in the single-stranded DNA substrate which may 

be the result of altered stability of secondary structures or an altered affinity of 
Cleavase™ for secondary structures containing modified nucleotides. These 
differences in Cleavase ™-based recognition of secondary structures in DNA fragments 
containing dUTP provides an unexpected benefit of using this nucleotide substitution. 



20 b) CFLP™ Analysis of Mutations in the KatG Gene of M. tuberculosis 

i) Generation of Plasmids Containing KatG Gene Sequences 

Genomic DNA isolated from wild-type M. tuberculosis or M. tuberculosis 
strains conatining mutations in the katG gene associated with isoniazid resistance were 
obtained from Dr. J. Uhl (Mayo Clinic, Rochester, MN). These four strains are 
25 termed wild-type, S315T, R463L and S315T;R463L [Cockerill, III et al, J. Infect. Dis. 

171:240 (1995). Strain S315T contains a G to C mutation in codon 315 of the wild- 
type katG gene. Strain R463L contains a G to T mutation in codon 463 of the wild- 
type gene and strain S315T;R463L contains both the G to C mutation in codon 315 
and the G to T mutation in codon 463. 
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A 620 bp region of the M. tuberculosis katG gene was amplified using the PCR 
from DNA derived from the above four strains. The primers used to amplify the katG 
gene sequences were KatG904 [5 ' - AGCTCGT ATGGCACCGGAAC-3 ' (SEQ ID 
NO:141) and KatG1523 [5'-TTGACCTCCCACCCGACTTG-3' (SEQ ID NO:142)]; 
these primers amplify a 620 bp region of katG gene which spans both the S315T and 
R463L mutations. The PCRs were conducted in a final reaction volume of 100 ul and 
contained the KatG904 and KatG1523 primers at 0.5 uM, 1.5 mM MgCl 2 , 20 mM 
Tris-HCl, P H 8.3, 50 mM KC1, 0.05 % each TWEEN-20 and NONIDET P-40, 60 uM 
of all 4 dNTPs. The reaction mixtures were heated at 95°C for 3 min, then 
amplification was started with addition of 5 units of Taq polymerase and continued for 
35 cycles at 95°C for 1 min, 60°C for 1 min and 72°C for 2 min. 

To clone the PCR-amplified katG fragments, 1 ul of each PCR product was 
used for ligation into the linear P T7Blue T-vector (Novagen, Madison,WI). The 
ligation products were used to transform competent JM109 cells and clones containing 
pTTBlue T-vector with an insert were selected by white color on LB plates containing 
40 us/ml X-Gal, 40 u^/ml IPTG and 50 u^/ml ampicillin. For each of the four PCR 
samples, four colonies were picked and grown overnight in 2 ml LB media containing 
50 Mg/ml carbenicillin. Plasmid DNA was isolated using the alkaline miniprep 
protocol described in Example 34. 

To analyze the cloned katG fragments, 1 ul of plasmid DNA from each clone 
was amplified by PCR using 100 ul reactions containing the KatG904 and KatG1523 
primers at 0.5 uM, 1.5 mM MgC12, 20 mM Tris-HCl, P H 8.3, 50 mM KC1, 0.05 % 
each TWEEN-20 and NONIDET P-40, 60 uM of all 4 dNTPs and 5 units of Taq 
polymerase. The PCRs were cycled 35 times at 95°C fori min, 60°C for 1 min and 
72°C for 2 min. PCR products were separated by electrophoresis on a 6% native 
polyacrylamide gel in 0.5X TBE buffer and clones that gave rise to a 620 bp fragment 
were selected for further analysis. 
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For sequencing purposes, plasmid DNA from selected clones was PEG-purified 
according to the protocol described in Exmple 34. Two microliters of plasmid DNA 
(approximately lOOng) was used for cycle- sequencing with FMOL R kit (Promega, 
Madison, WI) in conjunction with the KatG904 and KatG1523 primers containing a 
5 biotm moity at the 5' end. Biotinylation of the primers was performed using an 

Oligonucleotide Biotin Labeling kit (Amersham). Sequencing reactions were separated 
in a 8% denaturing polyacrylamide gel, transferred to a nylon membrane and 
visualized as described above m Example 21. The DNA sequences of the 620 bp katG 
gene fragments from the wild-type and mutant strains S315T, R463L and 
10 S315T;R463L are listed in SEQ ID NOS:143-146, respectively. The sequence of the 

sense strand for each of the four katG gene fragments is shown as the top line in SEQ 
ID NOS:143-146, espectively. The sequence of the anti-sense strand of the 620 bp 
katG gene fragments from the wild-type and mutant strains S315T, R463L and 
S315T;R463L is listed in SEQ ID NOS:147-150, respectively. 
l5 ii) Preparation of M. tuberculosis KatG Gene Substrates 

In order to generate substrates for use in CFLP™ reactions, the cloned 620 bp 
fragments derived from the wild-type and S315T, R463L and S315T;R463L M. 
tuberculous strains were amplified using the PGR. The PCRs were conducted m a 
final reaction volume of 100 ul and conatined 0.5 uM each KatG904 and KatG1523 
20 primers, 1.5 mM M g Cl 2 , 20 mM Tns-HCl, pH 8.3, 50 mM KC1, 0.05 % each 

TWEEN-20 and NONIDET P-40, 60 uM of all 4 dNTPs, 5 units of Taq polymerase 
and 10 ng of plasmid DNA from the sequenced clones as a template. The reactions 
were cycled 35 times at 95°C for 1 min, 60°C for 1 mm and 72°C for 2 min. 

To obtain 620 bp PCR fragments of the katG gene having a biotin label on the 
25 sense strand, and unlabeled KatG1523 primer (SEQ ID NO:142) and 5'-biotmylated 

KatG904 primer (SEQ ID NO:141) was used in the PCR; biotinylation was achieved 
using the Oligonucleotide Biotin Labeling kit (Amersham). To produce the same 
fragments having the TMR label on the antisense strand, unlabeled KatG904 (SEQ ID 
NO:141) and TMR-labeled KatG1523 (SEQ ID NO:142) primers were used in the 
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PCR. Amplified PCR products were purified on a 6% denaturing gel, eluted 
overnight, precipitated with ethanol and redissolved in 50 ul H 2 0 as described in 
Example 21. 

iii) Cleavage Reaction Conditions 

5 The cleavage reaction conditions for analysis of katG substrates labelled on the 

sense strand were as follows. Five microliters of biotin labeled PCR product were 
combmed with 1 ul of 10X CFLP™ buffer (100 mM MOPS, pH 7.5, 0.5% each 
TWEEN 20 and NP-40) and 25 ng Cleavase™ BN enzyme. Prior to the initiation of 
the cleavage reaction, the DNA mixtures were denatured by incubation at 95°C for 10 
10 sec. The reactions were then cooled to 50°C and the reaction was started by the 

addition of 1 Ml of 2 mM MnCl 2 . The cleavage reactions were incubated for 2 min. at 
50°C and were stopped by adding 5 ul of a solution containing 95% formamide, 10 
mM EDTA and 0.02% each bromphenol blue and xylene cyanol. Four and one-half 
microliters of each sample were run on a 10% denaturing polyacrylamide gel and 
1 5 labeled fragments were visualized following transfer to a nylon memebrane as 

described in Example 21. The resulting autoradiogram is shown in Figure 86. 

In Figure 86, lanes marked "M" contain biotinylated molecular weight markers 
obtained from Amersham (Arlington Heights, IL) and include bands corresponding to 
lengths of 50, 100, 200, 300, 400, 500, 700, and 1000 nucleotides; the size of the 
20 markers is indicated by the use of large arrowheads. Lanes 1-4 contain the reaction 

products obtained by incubating the R463L, R463L;S315T, S315T and wild-type katG 
substrates in the presense of Cleavase™ BN enzyme, respectively. The mutation 
distance from the 5' end label is 485 nucleotides for the R463L mutation and 41 
nucleotides for the S315T mutation when the label is present on the sense strand. 
25 The results shown in Figure 86 demonstrate that similar, but distinctly different 

patterns of cleavage were generated from the wild-type and S315L mutant (seen in 
both the S315T and S315;R463L substrates) katG substrates labelled on the sense 
strand. Comparison of the CFLP™ pattern for wild-type fragment (lane 4) shows that 
the S315T mutation (seen in both mutants R463L;S315T and S315T; lanes 2 and 3) 
30 results in disappearance of Band B which is located around 40 nucleotides from the 
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end label in the wild-type substrate. The disappearance of Band B correlates very well 
with the distance of S315T mutation from the 5' end (41 nucleotides from the 5' end 
label on the sense strand). Subsequent experiments have demonstrated that the R463L 
mutant can be distinguished from wild-type by a mobility shift in a band migrating at 
5 approximately 500 nt from the 5' end label on the sense strand (the band shifts 

downward in the R463L mutant), but is difficult to resolve m many gels systems. 

The cleavage reaction conditions for analysis of katG substrates labeled on the 
anti-sense strand were as described for the sense strand. Four and one-half microliters 
of each sample were run on a 10% denaturing poly aery lamide gel and labeled 
10 fragments were visualized using the Hitachi FMBIO-100 fluoroimager as described in 

Example 35(a)(iii). The resulting scan is shown in Figure 87. 

In Figure 87, lanes marked "M" contain plasmid pUC19 DNA digested with 
Mspl and 3' end labeled with fluorescein ddUTP using terminal deoxynucleotidyl 
transferase as described in Example 10. This marker includes bands corresponding to 
15 lengths ofllO/111, 147, 190, 242, 331, 404, 489 and 501 bp. Additional marker bands 

of 26, 34, and 67 bp are not visible in this figure; the size of the markers is indicated 
by the' use of large arrowheads. Lanes 1-4 contain the reaction products obtained by 
incubating the R436L, S315T;R463L, S315T, and wild-type katG substrates in the 
presense of Cleavase™ BN enzyme, respectively. The location of the single base 
mutatbn from the 5' end label is 136 nucleotides for the R463L mutation and 580 
nucleotides for the S315T mutation when the label is present on the anti-sense strand. 

The results shown in Figure 87 demonstrate that wild-type can be distinguished 
from mutants containing the R463L substitution on the anti-sense strand. Comparison 
of the lanes containing the S315T;R463L double mutant or the R463L mutant by itself 
25 demonstrates that the R463L mutation is associated with the presence of a strong band 

migrating at approximately 130 nt (band A). This result, taken with that presented in 
Figure 86, demonstrates that all three of these mutants can be distinguished from one 
another, as well as from wild type, by CFLP™ analysis. 

The CFLP™ technology offers cost benefits by reducing gel electrophoresis 
30 processing time from 12-18 hours down to 5 to 10 minutes. Adapting the readout to 
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multi-lane Fluorescence Image Detectors allows for an expanded volume of work by 
allowing simultaneous processing of up to 48 reactions. The consequent decrease in 
turnaround time in performing the analyses reduces the turnaround time of reporting 
patient results from days to hours, or, as in the case of MDR-TB patients, from weeks 
to hours Early detection of MDR-TB can save thousands of dollars per patient by 
reducing the expense of extended stays in isolation wards, spent while testing various 
antibiotic treatments for efficacy. 

EXAMPLE 36 

Rapid Identification of Bacterial Strains by CFLP™ Analysis 

The results shown above demonstrated that CFLP™ analysis can be used to 
detect the presence of wild-type and drug-resistant mutations of M. tuberculosis by 
examining portions of gene associated with drug resistance (e.g., rpoB and katG). In 
order to examine whether the CFLP™ analysis could be used as a method of detecting 
and identifying a wide variety of microorganisms, CFLP™ analysis was conducted 
using substrates derived from bacterial 16S rRNA genes. 

Bacterial 16S rRNA genes vary throughout the phylogenetic tree; these genes 
do contain segments which are conserved at the species, genus or kingdom level. 
These features have been exploited to generate primers containing concensus sequences 
which flank regions of variability. These primers have been used to amplify segments 
of bacterial 16S rRNA genes which are then characterized by either Southern blot 
hybridization [Greisen et ai, J. Clin. Microbiol. 32:335 (1994)] or SSCP analysis 
[Widjojoatmondjo et al, J. Clin. Microbiol 32:3002 (1994)]. These types of analyse, 
while faster than traditional cultunng methods, are at best limited to the differentiation 
of species within a particular genus and higher bacterial taxons. However, it is often 
desirable to differentiate between different strains of the same species. For example, a 
given species may contain subspecies which comprise harmless as well as pathogenic 
organisms. In order to develop a technique which would allow the differentiation 
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between species and/or subspecieis, CFLP™ analysis was applied to segments derived 
from bacterial 16S rRNA genes. 

a) Bacterial Strains 

Table 3 below lists the bacterial strains used in this study. These strains were 
derived from the ATCC strains listed below with the exception of Desulfurococcus 
amylolyticus Strain Z-533 which was derived from a deposit obtained from the 
Deutsche Sammlung von Mikroorganismen (DSM). 



TABLE 3 
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15 



20 



25 



ORGANISM 


STRAIN NO. 


CHARAClkK.li>ll<-J> 


E. coli 


ATCC 11303 


Strain B 


E. coli 


ATCC 14948 


Derived from E. coli strain k-iz 


E. coli Serotype 0157: H7 


ATCC 43895 


Produces Shiga-like toxins I and II 


Campylobacter jejuni 
subsp. jejuni 


ATCC 33291 


Isolated from human stool 


Shigella dysenteriae 
Serotype 2 


ATCC 29027 


Isolated from human stool 


Salmonella choleraesuis 
subsp. choleraesuis 
Serotype typhi 


ATCC 6539 


Used for germicide testing 


Staphylococcus aureus 
subsp. aureus 


ATCC 33591 


Methicillin-resistant 


S. aureus subsp. aureus 


ATCC 33592 


Gentamicm- and methicillin-resistant 


S. aureus subsp. aureus 


ATCC 13565 


Produces enterotoxin A and large amounts 
of beta-hemolysin 


Staphylococcus hominis 


ATCC 29885 


Methicillin control for MIC testing 


Staphylococcus warneri 


ATCC 17917 


Used for soap germicide testing 


Desulfurococcus 
amylolyticus 


STRAIN 3822 


Thermophilic archaebacterium 



- 281 - 



10 



15 



The strains listed in Table 3 represent pathogenic microorganisms with the 
exception of E. coli strains B and K-12 and Desulfuro coccus amylolyticus. 
Desulfurococcus amylolyticus was included in this study to determine whether the 
concensus primers, whose design was based upon known rRNA gene sequences, could 
also be used to amplify rRNA gene fragments sequences from archeabacterial species 
whose rRNA gene sequences have not been reported. 

The strains listed in Table 3 were selected to provide representatives from 
several different genera {e.g., Escherichia, Shigella, Salmonella, Campylobacter, etc.) 
as well as to provide several representatives of different species (or subspecies) within 
a given genus. For example, three different strains of E. coli were chosen so that the 
consistency (or lack thereof) of the CFLP™ banding pattern generated by cleavage of 
an rRNA gene substrate could be exmained between species within a given genus. In 
addition, E. coli Serotype 0157:H7 was examined as this strain has been implicated in 
hemorragic colitis outbreaks. It was of interest to examine whether the CFLP™ 
pattern observed from clevage of a rRNA gene substrate from E. coli strains B or K- 
12 differed from that produced by cleavage of a rRNA gene substrate from E. coli 
Serotype 0157:H7. 

Table 4 below describes the phylogenic relationship between the strains used in 
this example. 
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TABLE 4 



Phylogenetic position of strains from Prokaryotic 
Small SubUnit rRNA Taxonomic List' 



1 


ARCHAEA 


1.2 


CRENARCHAEOTA 


1.2.1 


CRENARCHAEOTA-GROUP-I 

Desulfurococcus amylolyticus 


2 


BACTERIA 


2.13 


PURPLE-BACTERIA 


2.13.3 


GAMMA-SUBDIVISION 


2.13.3.15 


ENTERICS AND RELATIVES 


2.13.3.15.2 


E S CHERICHI A- SALMONELLA ASSEMBLAGE 
Escherichia coli Strain B 
Escherichia coli Strain K-12-derived 
Escherichia coli Serotype 0157: H7 
Shigella dysenteriae Serotype 2 
Salmonella choleraesuis subsp. choleraesuis 
Serotype typhi 


2.13.5 


EPSILON-SUBDIVISION 


2.13.5.2 


CAMPYLOBACTER AND RELATIVES 
Campylobacter jejuni subsp. jejuni 


2.15 


GRAM-POSITIVE PHYLUM 


2.15.5 


BACILLUS-LACTOBACILLUS-STREPTOCOCCUS 
SUBDIVISION 


2.15.5.10 


STAPHYLOCOCCUS GROUP 


2.15.5.10.2 


STAPHYLOCOCCUS SUBGROUP 

Staphylococcus aureus subsp. aureus ATCC 33591 
Staphylococcus aureus subsp. aureus ATCC 33592 
Staphylococcus aureus subsp. aureus ATCC 13565 
Staphylococcus hominis 
Staphylococcus warneri 



1 Data derived from the Ribosomal Database Project; available on the Inernet 
http://rdp.life.uiuc.edu/index.html; Maidak et at, Nucleic Acids Res., 22:3485 (1994). 
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b) Growth of Microorganisms 

In order to minimise handling of the pathogenic strains, the microorganisms 
were grown on slant cultures or on plates rather than in liquid culture. 

i) Growth of Escherichia, Shigella, Salmonella, and Staphylococcus 

5 species 

All strains were derived from the ATCC strains listed above in Table 3 as 
follows. A loopful of a culture previously frozen in Trypticase Soy Broth and 15% 
glycerol (Remel Corp., Lenexa, KS. Cat. 06-5024) was subcultured onto a trypticase 
soy agar slant (Remel, Cat. 06-4860). The cultures were incubated overnight at 37°C. 

10 ii) Growth of Campylobacter species 

A loopful of a culture previously frozen in Trypticase Soy Broth and 15% 
glycerol (Remel Corp., Lenexa, KS. Cat. 06-5024) was subcultured onto 
Campylobacter Agar supplemented with 10% sheep blood, amphotericin B, 
cephalothin, trimethoprim, vancomycin, and polymyxin B (BBL, Cat. 21727). 

15 Inoculated plates were sealed in Campy microaerophilic pouches (BBL, Cat. 4360656) 

and incubated at 42°C for 3 days. 

c) Extraction of Genomic DNA from Microorganisms 

For each bacterial sample, 300 ^1 of TE buffer [10 mM Tris-HCl 1 (pH 8.0 at 
25°C), 1 mM EDTA] and 300 ul phenol:cloroform:isoamyl alcohol (25:24:1) were 

20 placed in a 1.5 ml microfuge tube. This combination is referred to as the extraction 

buffer. A loopful (approximately 0.1 ml) of the desired bacterial strain was removed 
from a slant culture or plate and combined with the extraction buffer in a 1.5 ml 
microfuge tube and the contents were vortexed for two minutes. The extracted DNA 
present in the aqueous phase was processed for further purification as described below. 

25 Samples of E. coli and C. jejuni strains were ethanol precipitated and dissolved 

in 50 \x\ TE buffer. The samples were then treated with 0.5 ^g RNase A at 37°C for 
30 min. DNA was precipitated with ethanol, collected by centrifugation and dissolved 
in 200 p.1 10 mM Tris-HCl (pH 8.0 at 25°C). 
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Samples of Shigella, Salmonella, and Staphylococcus strains were concentrated 
using a MICROCON™ 30 filter (Amicon) to 50 ul and then transferred to TE buffer 
using MICROSPIN™ S-200 HR gel filtration columns (Pharmacia Biotech). The 
samples were then treated with 0.5 RNase A at 37°C for 80 mm. DNA was 
precipitated with ethanol, collected by centrifugation and dissolved in 200 p.1 10 mM 

Tris-HCl (pH 8.0 at 25°C). 

Genomic DNA of E. coli Strain B (ATCC 11303) was obtained from 
Pharmacia Biotech (Piscataway, NJ; Cat. 27-4566-01, Lot 411456601). The DNA was 
dissolved in 10 mM Tris-HCl (pH 8.0 at 25°C). 

Genomic DNA from Desulfurococcus amylolyticus Strain Z-533 (DSM 3822) 
was isolated and purified using the standard technique of cesium chloride 
centrifugation. [Bonch-Osmolovskaya, et al, Microbiology (Engl. Transl. of 
Mikrobiologiya) 57: 78 (1988)]. 

The concentration of the genomic DNA preparations was determined by 

measuring the OD 260 of the preparations. 

d) Design of Primer for the Amplification of 16s rRNA Genes of 
Bacterial Species 

Primers and probes have been reported which allow the amplification or 
detection of 16S rRNA sequences from a wide variety of bacterial strains. These 
oligonucleotide primers or probes represent consensus sequences derived from a 
comparision of the 16s rRNA gene sequences from a variety of eubacterial species. 
For example, oligonucleotide primers suitable for either PCR amplifcation or dot blot 
hybridization of bacterial rRNA gene sequences have been reported [e.g., PCT 
Publication WO 90/15157; Widjojoatmodjo et al, J. Clin. Microbiol 32:3002 (1994)]. 
Typically the conserved primer sequences are designed to flank nonconserved regions 
of the 16s rRNA gene with species-specific sequences. 
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A number of previously published conscensus primers derived from 16S rRNA 
gene sequences were examined for the ability to produce substrates for use in CFLP™ 
reactions. Primers 1638, 1659 and 1743 were described in PCT Publication WO 
90/15157. Primer ER10 was described in Widjojoatmodjo et al, supra. Primers SB- 
1, SB-3 and SB-4 represent new primers {i.e., not previously published). The primers 
used in this example are listed in .Table 5 below. 

TABLE 5 

Primers for PCR Amplification of 16S rRNA Genes 



PRIMER 


SEQ ID 
NO: 


SEQUENCE 


1638 


151 


5 '-AGAGTTTGATCCTGGCTCAG-3 ' 


ER10 


152 


5 ' -GGCGGACGGGTGAGT AA-3 ' 


1659 


153 


5'-CTGCTGCCTCCCGTAGGAGT-3' 


SB-4 


154 


5'-ATGACGTCAAGTCATCATGGCCCTTACGA-3' 


1743 


155 


5'-GTACAAGGCCCGGGAACGTATTCACCG-3' 


SB-1 


156 


5 ' -GCAACGAGCGCAACCC-3 ' 


SB-3 


157 


5 ' - ATGACGTC AAGTC ATC ATGGCCCTT A -3' 



The oligonucleotide primers were obtained from Integrated DNA Technologies, 
Inc. The oligonucleotides were dissolved in 10 mM Tris-HCl (pH 8 at 25°C) at a 
concentration of 20 uM. Two sets of primers were synthesized; one set having an OH 
group at the 5' end (i.e., unlabeled primers) and the other set having the fluorescent 
dye TET (tetrachlorinated analog of 6-carboxyfluorescein, Applied Biosystems) at the 
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5' end (i.e., TET-labelled primers). TET-labelled primers are indicated by theuse of 
"TET" as a suffix to the primer name (for example, TET-1638 indicates the 1638 

primer having a 5' TET label). 

The location of each of the primers listed in Table 5 is shown along the 
5 sequence of the E. coli rrsE gene (encodes a 16S rRNA) in Figure 88. In Figure 88 

the primer sequences are shown in bold type and underlining is used to indicate 
complete identity between primer sequences and E. coli rrsE gene sequences. The 
sequence of the E. coli rrsE gene is listed in SEQ ID NO: 158. As shown in Figure 
88, the 1638, ER10, SB-1, SB-3, SB-4 primers correspond to sequences present on the 
10 sense strand of the 16S rRNA gene. The 1659, 1743 primers correspond to sequences 

present on the anti-sense strand of the 16S rRNA gene. 

Figure 89 provides an alignment of the E. coli rrsE gene (SEQ ID NO:158), 
the Cam.jejun5 gene (a rRNA gene from C. jejuni) (SEQ ID NO:159) and the 
Stp.aureus gene (a rRNA gene from S. aureus) (SEQ ID NO:160). The location of the 
1638, ER10, 1659 (shown as the complement of 1659), SB-1, SB-3, SB-4 and 1743 
(shown as the complement of 1743) primers is indicated by the bold type. Gaps 
(dashes) are introduced to maximize alignment between the rRNA genes. 

In procaryotes the ribosomal RNA genes are present in 2 to 10 copies, with an 
average of 7 copies in Escherichia strains. Any PCR amplification produces a mixed 
population of these genes and is in essence a "multiplex" PCR from that strain. The 
CFLP represents a composite pattern from the slightly varied rRNA genes within that 
organism so no one particular rRNA sequence is directly responsible for the entire "bar 
code." In some cases these minor variations (bewteen rRNA genes; see, for example, 
minor variations between the E. coli rRNA gens in Figure 88) cause shifts in the 
25 minor (lower signal) bands in the CFLP pattern, allowing discrimination between very 

closely related organisms. More dramatic sequence variations, found in most or all 
copies of these genes, are seen when more distantly related organisms are compared 
(see, for example, the extensive variations between the E. coli, C. jejuni and S. aureus 
rRNA genes in Figure 89) and these larger differences are reflected in the CFLP 
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patterns as more dramatic pattern changes. Despite the variable nature of these genes, 
the amplification by PCR can be performed between conserved regions of the rRNA 
genes, so prior knowledge of the entire collection of rRNA sequences for any microbe 

of interest is not required. 

Three primers (TET-1638, TET-ER10, and TET-SB-4) were used for making 
the 5' end fluorescently labeled fragments of the sense strand of 16S rRNA genes; two 
other primers (TET-1659 and TET-1743) were used for making labeled fragments of 

the antisense strands. 

The predicted size of PCR products produced by amplification of 16s rRNA 
gene sequences from a variety of bacterial genera using the indicated primer pairs is 
shown in Table 6. In Table 6, the size of the predicted PCR product is based upon the 
known sequence of the 16S rRNA gene in the indicated species. The following 
abbreviations are used in Table 6: Deo (Desulfurococcus); E.co (E. coli), Cam 
(Campylobacter) and Stp (Staphylococus). The location of the PCR product relative to 
the sequence of the E. coli rrsE gene (see Figure 88) is given in the last column. 
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TABLE 6 

Combinations of Primers for PCR Amplification of 16S rRNA Sequences 




e) PCR Amplification of 16S rRNA Gene Sequences 

The ability of each primer pair listed in Table 6 to amplify 16S rRNA gene 
sequences from each bacterial strain listed in Table 3 was examined. It is well known 
that commerical preparations of recombinant Tag DNA polymerase contain various 
amount of E. coli 16S rRNA gene sequences. In order to minimize amplification of 
contaminating E. coli 16S rRNA sequences during the amplification of bacterial DNA 
samples, AMPLITAQ DNA polymerase, LD (Low DNA) (Perkin Elmer) was used in 
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the PCRs. This preparation of Tag DNA polymerase is tested by the manufacturer to 
verify that less than or equal to 10 copies of bacterial 16S ribosomal RNA gene 
sequences are present in a standard 2.5 unit aliquot of enzyme. 

Each primer pair (Table 6) was tested in PCRs. The PCR reactions contained 
10 mM Tris-HCl (pH 8.3 at 25°C), 50 mM KC1, 1.5 mM MgCl 2 , 0.001% w/v gelatin, 
60 uM each of dGTP, dATP, dTTP, and dCTP, 1 uM each of one 5 -TET labeled 
and one unlabeled primers, 2.5 units AMPLITAQ DNA polymerase, LD. The 
reactions were conducted in a final volume of 50 ul using AMPLITAQ DNA 
polymerase, LD, Lot E0332, or 100 ul volume using AMPLITAQ DNA polymerase, 
LD, Lot D0008. The amount of genomic DNA added varied from 6 to 900 ng per 
PCR. Control reactions which contained no input bacterial genomic DNA were also 
run to examine the amount of 16S rRNA product produced due to contaminats in the 
AMPLITAQ DNA polymerase, LD preparations. 

PCR reactions were performed on PTC- 100™ Programmable Thermal 
Controller (MJ Research, Inc.). Two sets of cycling conditions were utilized. The 
first set of conditions comprised 30 cycles of 95°C for 30 sec; 60°C for 1 min; 72°C 
for 30 sec; after the last cycle the tubes were cooled to 4°C. The second set of 
conditions comprised 30 cycles of of 95° for 30 sec; 60°C for 1 min; 72°C for 90 
sec after the last cycle the tubes were cooled to 4°C Thus, the difference between 
the'two cycling conditions is the length of time the reactions are held at the elongation 
temperature (72°C). These two elongation times were tested because the predicted size 
of the 16S rRNA targets varied from 208 to 1388 bp depending on the primer pair 

used in the amplification. 

As a rule of thumb, when the target to be amplified is less than 500 bp in 
length, a 30 sec elongation step is used; when the target is about 500-1000 bp in 
length, an elongation step of 30 to 60 sec is used; when the target is greater than 1 kb 
in length, the elongation is conducted for approximately 1 min per 1 kb length. While 
the first set of PCR conditions (30 sec elongation step) worked with the longer 
amplicons, the yield was lower than that obtained when the second set of PCR 
conditions (90 sec elongation) was used. 
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Following the thermal cycling, 400 ul of formamide containing 1 mM EDTA 
was added to each sample and the samples were concentrated to a volume of 40 ul in 
a MICROCON 30. The samples (40 ul) were loaded on a denaturing 6% 
polyacrylamide gel (7 M urea, 0.5X TBE running buffer), that was prewarmed to 50- 
5 55°C prior to the loading of the samples. The samples were run at 20 W for 20 min 

(200-350 bp fragments) or 40 mm (more than 1 kb fragments). The gels were scanned 
using a Fluorescent Method Bio Image Analyzer Model 100 (FMBIO-100, Hitachi) 

with a 585 or 505 ran filter. 

The results of these PCRs showed that each primer pair (Table 6) tested 

10 successfully amplified a fragment of the expected size. Thus the primer pairs shown 

in Table 6 are suitable for the amplification of end labeled DNA fragments using 
genomic DNA from variety of prokaryotes including archaea, gram-positive and gram- 
negative bacteria, different species of the same genus and different strains of the same 
species. These PCRs also demonstrated that, although the amount of genomic DNA 

1 5 present in the PCR varied from strain to strain, the yield of the amplified product was 

always many-fold higher than the trace yield of product from the E. coli genomic 
DNA present in AMPLITAQ DNA polymerase, LD, seen in the reactions which 
contained no input bacterial genomic DNA. 

f) Preparation of 16S rRNA Gene Substrates 

20 To generate labelled PCR products corresponding to bacterial 16S rRNA 

sequences for use in CFLP™ reactions, the following primer pairs were used in PCRs. 

1 The SB-l/TET-1743 pair was used to amplify an approximately 297 bp 

fragment from genomic DNA derived from Desulfurococcus amylolyticus (DSM 3822), 
E. coli Strain K-12 (ATCC 14948), S. aureus subsp. aureus (ATCC 33591) and S. 
25 aureus subsp. aureus (ATCC 33592). The resulting PCR product contains a 5' TET- 

label on the antisense strand. 
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The TET-SB-4/1743 pair was used to amplify an approximately 208 bp 
fragment from genomic DNA derived from E. coli Stain B (ATCC 11303), E. coli 
Strain K-12 (ATCC 14948), E. coli Serotype 0157: H7 (ATCC 43895), Shigella 
dysenteriae Serotype 2 (ATCC 29027), and Salmonella choleraesuis subsp. 
5 choleraesuis Serotype typhi (ATCC 6539). The resulting PCR product contains a 5' 

TET-label on the sense strand. 

3 The 1638/TET-1659 pair was used to amplify an approximately 350 bp 
fragment from genomic DNA derived from E. coli Stain B (ATCC 11303), E. coli 
Strain K-12 (ATCC 14948), E. coli Serotype 0157: H7 (ATCC 43895), Shigella 
10 dysenteriae Serotype 2 (ATCC 29027), and Salmonella choleraesuis subsp. 

choleraesuis Serotype typhi (ATCC 6539). The resulting PCR product contains a 5' 
TET-label on the antisense strand. 

4. The TET-ER10/1743 pair was used to amplify an approximately 1292 bp 
fragment from genomic DNA derived from E. coli Strain K-12 (ATCC 14948) and 
Campylobacter jejuni subsp. jejuni (ATCC 33291). The resulting PCR product 
contains a 5' TET-label on the sense strand. 

5. The 1638/TET-1659 pair was used to amplify an approximately 350 bp 
fragment from genomic DNA derived from E. coli Serotype 0157: H7 (ATCC 43895), 
Salmonella choleraesuis subsp. choleraesuis Serotype typhi (ATCC 6539), Shigella 
dysenteriae Serotype 2 (ATCC 29027), S. aureus subsp. aureus (ATCC 33591), S. 
aureus subsp. aureus (ATCC 33592), S. aureus subsp. aureus (ATCC 13565), S. 
hominis (ATCC 29885), and S. warneri (ATCC 17917). 

The PCRs were conducted as described in section (e) above. Two separate 
PCR reactions were performed using 0.2 ng of genomic DNA derived from 
Camylobacter jejuni subsp. jejuni (ATCC 33291) and the TET-ER10/1743 primer pair. 
One reaction was conducted in a final volume of 50 ul and used an extension step of 
30 sec at 72°C during thermal cycling. The second reaction was conducted in a final 
volume of 100 ul and used an extension step of 90 sec at 72°C. The yield of PCR 
product produced in the second reaction was 76% higher (as compared to first 
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reaction). Following the amplification reaction, the samples were processed for 
electrophoresis on denaturing polyacrylamide gels as described in sction (e) above. 
After electrophoresis, the desired bands were cut from the gel and eluted by placing 
the gel slice into 0.4 ml of a solution containing 0.5 M ammonium acetate, 0.1 mM 
5 EDTA and 0.1 % SDS. The mixture was then incubated at 55°C for 2 h and then at 

37°C for 12 h. The samples were concentrated to 25 ul using a MICROCON 30 
(Amicon) and transferred into water using S-200 MICROSPIN columns (Pharmacia). 

g) Cleavage Reaction Conditions 

Cleavage reactions were conducted in a final volume of 10 ul volume 
10 containing approximately 0.2 to 1 pmole (as indicated below) 5' TET-labeled DNA 

substrate, 10 ng Cleavase™ BN (Third Wave Technologies), IX CFLP™ buffer and 
0.2 mM MnCl 2 . The reactions were first assembled as a 9 ul mixture lacking MnCl 2 ; 
this mixture was heated to 95°C for 10 sec and then cooled down to the desired 
incubation temperature (45°C, 50°C or 65°C). Optimal reaction temperature fro each 
15 substrate was chosen based on even distribution of bands, and the presence of some 

undigested material to indicate representation of molecules all the way up to full 
length. Selected optimal temperatures for each substrate are indicated in the 
description of Figures 90-93 below. 

The cleavage reaction was started by the addition of 1 ul of 2 mM MnCl 2 . 
20 Following incubation at the desired temeprature for 2 min, the reaction was stopped by 

the addition of 10 ul of a solution containing 95% formamide, 5 mM EDTA, 5% 
glycerol and 0.02% methyl violet. Uncut or "no enzyme" controls were set up for 
each substrate as described above with the exception that H20 was used in place of 
the Cleavase™ BN enzyme. Samples (approximately 4 to 8 ul) were run on 6 to 
25 12% denaturing polyacrylamide gels (19:1 cross link) with 7 M urea in a buffer 

containing 45 mM Tns Borate, pH 8.3, 1.4 mM EDTA at 15 to 20 W for 9 minutes 
(specific gel percentages are indicated below in the descriptions of Figures 90-93). 
The gels were then scanned using a FMBIO-100 (Hitachi) with the 585 nm filter. 
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The resulting fluoroimager scans are shown in Figures 90-93. In Figure 90, the 
cleavage products generated by cleavage of an approximately 297 bp 16S rRNA 
substrate generated using the SB-l/TET-1743 pair and genomic DNA derived from 
Desulfurococcus amylolyticus (DSM 3822), E. coli Strain K-12 (ATCC 14948), S. 
5 aureus subsp. aureus (ATCC 33591) and S. aureus subsp. aureus (ATCC 33592) is 

shown. Lanes 1-4 contain the products generated by incubation of the substrate 
derived from Desulfurococcus amylolyticus (DSM 3822), E. coli Strain K-12 (ATCC 
14948), S. aureus subsp. aureus (ATCC 33591) and S. aureus subsp. aureus (ATCC 
33592) in the absense of Cleavase™ BN enzyme, respectively. Lanes 5-8 contain the 
10 products generated by incubation of the substrate derived from Desulfurococcus 

amylolyticus (DSM 3822), E. coU Strain K-12 (ATCC 14948), S. aureus subsp. aureus 
(ATCC 33591) and S. aureus subsp. aureus (ATCC 33592) in the presence of 
Cleavase™ BN enzyme, respectively. The CFLP™ reactions were performed using 
approximately 1 pmole of each PGR product and the cleavage reactions were incubated 
1 5 at 50°C for 2 min. The cleavage products were resolved by electrophoresis on an 8% 

polyacrylamide gel, as described above. 

The results shown in Figure 90 demonstrate that distinct CFLP™ patterns are 
obtained using the Desulfurococcus amylolyticus (DSM 3822), E. coli Strain K-12 
(ATCC 14948) and S. aureus subsp. aureus substrates. The same CFLP™ pattern was 
20 generated by cleavage of the two S. aureus subsp. aureus substrates (lanes 7 and 8); 

these two S. aureus subsp. aureus strains (ATCC 33591 and 33592) are considered 
different subspecies based upon differences in sesitivities to the antibiotics methicilhn 
and gentamicm. Resistant or sensitivity to these antibiotics is not associated with 
mutation in the 16S rRNA gene; therefore it was not expected that different CFLP™ 
25 patterns would be observed using a 16S rRNA substrate. 

The results shown in Figure 90 show that the SB-l/TET-1743 pair can be used 
to generate substrates for CFLP™ analysis which allow the identification and 
discrimination of Desulfurococcus amylolyticus (DSM 3822), E. coli Strain K-12 
(ATCC 14948) and S. aureus subsp. aureus. 
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In Figure 91, Panel A shows the reaction products generated by cleavage of an 
approximately 208 bp 16S rRNA substrate generated using the TET-SB-4/1743 pair 
and genomic DNA derived from E. coli Stain B (ATCC 11303), E. coli Strain K-12 
(ATCC 14948), E. coli Serotype 0157: H7 (ATCC 43895), Shigella dysenteriae 
Serotype 2 (ATCC 29027), and Salmonella choleraesuis subsp. choleraesuis Serotype 
typhi (ATCC 6539). TheTET-SB-4/1743 pair amplifies a portion of the 16S rRNA 
gene located in the 3' region of the gene (see Figure 88). 

The CFLP™ reactions shown in Figure 91, Panel A were performed using 
approximately 0.7 pmole of each PCR product and the cleavage reactions were 
incubated at 50°C for 2 min. The cleavage products were resolved by electrophoresis 
on an 8% denaturing polyacrylamide gel, as described for Figure 90. 

In Figure 91, Panel B shows the reaction products generated by cleavage of an 
approximately 350 bp 16S rRNA substrate generated using the 1638/TET-1659 pair 
and genomic DNA derived from E. coli Stain B (ATCC 11303), E. coli Strain K-12 
(ATCC 14948), E. coli Serotype 0157: H7 (ATCC 43895), Shigella dysenteriae 
Serotype 2 (ATCC 29027), and Salmonella choleraesuis subsp. choleraesuis Serotype 
typhi (ATCC 6539). The 1638/TET-1659 pair amplifies a portion of the 16S rRNA 
gene located in the 5' region of the gene (see Figure 88). 

The CFLP™ reactions shown in Figure 91, Panel B were performed using 
approximately 1 pmole of each PCR product and the cleavage reactions were incubated 
at 45°C. The cleavage products were resolved by electrophoresis on an 8% 
polyacrylamide gel. 

' The lanes marked "M" in Figure 91, Panels A and B contain plasmid pUC19 
DNA digested with Mspl and 3' end labeled with fluorescein ddUTP using terminal 
deoxynucleotidyl transferase as described in Example 10. Tins marker includes bands 
corresponding to lengths of 26, 34, 67, 110/111, 147, 190, 242 and 331 bp. 
Additional marker bands of 404, 489 and 501 bp are not visible in this figure. In 
Panel A, lanes 1-5 contain the uncut (i.e., no enzyme) controls and lanes 6-10 contain 
the cleavage products generated by the incubation of substrates derived from E. coli 
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Stain B (ATCC 11303), E. coli Strain K-12 (ATCC 14948), E. coli Serotype 0157: 
H7 (ATCC 43895), Shigella dysenteriae Serotype 2 (ATCC 29027), and Salmonella 
choleraesuis subsp. choleraesuis Serotype typhi (ATCC 6539), respectively. In Panel 
B lanes 1-5 contain the uncut (,e., no enzyme) controls and lanes 6-10 contain the 
5 cleavage products generated by the incubation of substrates derived from E. coli Stain 

K-12 (ATCC 14948), E. coli Strain B (ATCC 11303), E. coli Serotype 0157: H7 
(ATCC 43895), Shigella dysenteriae Serotype 2 (ATCC 29027), and Salmonella 
choleraesuis subsp. choleraesuis Serotype typhi (ATCC 6539), respectively. 

The lower molecular weight materials seen in the "uncut" lanes has been found 
10 to be due to degradtion of the g el- P urified material after storage for several days in 
dH20. This degradtion may be due to environmental nucleases that are active when 
EDTA is not present in the storage solution (i.e., the necessary metal ions may be 
present in trace amounts). This degradation is effectively supressed by mculsion of 
tRNA in the storage solution (see Example 21). The degradation seen in these uncut 
15 controls (Pane B, lanes 1-5) does not effect the CFLP results. 

The results shown in Figure 91 demonstrate that some regions of the 16S rRNA 
genes are more variable than others, and that analysis of these regions are particularly 
useful when comparing very closely related organisms. For example, substrates 
generated by the 1638/TET-1659 pair (which amplifies a portion of the 16S rRNA 
20 gene located in the 5' region of the gene) can be used to generate CFLP™ patterns 

which distinguish not only between the DNA derived from the genera of Escherichia, 
Shigella, and Salmonella (Panel B, lanes 6-10), but which also creates distinct 
cleavage patterns from the DNA derived from the three strains of E. coli tested (,e., 
strains B, K-12 and 0157: H7) (Panel B lanes 6-8). 
25 In contrast, no substantial difference in CFLP™ patterns was observed between 

the strains of the Escherichia-Salmonella assemblage for DNA fragments produced 
using the TET-SB-4/1743 pair which generates an approximately 208 bp fragment 
located near the 3 end of 16S rRNA genes (Panel A, lanes 6-10). This contrast in 
the degree of variation between the 5' and 3' regions of the 16S rRNA genes is 
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consistent with the results reported by Widjojoatmondjo et al, supra, in which the 
comparisons between strains of the Escherichia-Salmonella assemblage were made by 
SSCP analysis. 

Since each organism has mutiple copies of the 16S rRNA gene, and these co- 
amplify in each PCR, it was important to show that the products of different 
amplifications from the same organism produced the same cleavage pattern. In Figure 
92, the cleavage products generated by cleavage of an approximately 1292 bp 16S 
rRNA substrate generated using the TET-ER1 0/1743 pair in two separate PCR 
reactions from Campylobacter jejuni subsp. jejuni (ATCC 33291) are shown in lanes 2 
and 3. For comparison, the same region amplified from E. coli Strain K-12 (ATCC 
14948) is shown in lane 1. The CFLP™ reactions were performed using 
approximately 60 FMOLe of each PCR product and the cleavage reactions were 
incubated at 50°C for 2 min. Reactions were stopped by the addition of 95% 
foramide, 5 mM EDTA, 5% glycerol and 0.02% methyl violet. The cleavage products 
were resolved by electrophoresis on a 6% denaturing polyacrylamide gel as described 
above. 

The results shown in Figure 92 demonstrate that very different CFLP™ 
patterns were generated using substrates from Gamma {Escherichia, lane 1) and 
Epsilon {Campylobacter, lanes 2 and 3) subdivisions of Purple bacteria, but that the 
same CFLP™ pattern was observed between the products of separate PCR reactions on 
the same genomic DNA (lanes 2 and 3). 

In Figure 93, the cleavage products generated by cleavage of an approximately 
350 bp 16S rRNA substrate generated using the 1638/TET-1659 pair and genomic 
DNA derived from E. coli Serotype 0157: H7 (ATCC 43895), S. choleraesuis subsp. 
choleraesuis Serotype typhi (ATCC 6539), Shigella dysenteriae Serotype 2 (ATCC 
29027), S. aureus subsp. aureus (ATCC 33591), S. aureus subsp. aureus (ATCC 
33592) S. aureus subsp. aureus (ATCC 13565), S. hominis (ATCC 29885), and S. 
varneri (ATCC 17917) are shown in lanes 1-8, respectively. The CFLP™ reactions 
were performed as described above, using approximately 200 FMOL of each PCR 
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product; the cleavage reactions were incubated at 65°C for 2 mm. The cleavage 
products were resolved by electrophoresis on a 10% denaturing polyacrylamide gel as 
described above. 

The results shown in Figure 93 demonstrate that very different CFLP™ 
5 patterns were produced using DNA derived from strains representing Purple bacteria 

(lanes 1-3) and the Gram-positive phylum (lanes 4-8). A substantial difference 
between CFLP™ patterns was detected between the genera Escherichia (lane 1), 
Salmonella (lane 2), and Shigella (lane 3). 

Additionally, a substantial difference between the CFLP™ patterns was 
10 detected between species of Staphylococcus aureus (lanes 4-6), hominis (lane 7), and 

varneri (lane 8). No substantial difference between CFLP™ patterns was observed 
between the three strains of Staphylococcus aureus subsp. aureus ATCC 33591 (lane 
4) ATCC 33592 (lane 5), and ATCC 13565 (lane 6). These S. aureus isolates differ 
unreported antibiotic resistance, but are so closely related that the rRNA genes do not 
15 yet show divergence by CFLP™ analysis. 

The above results demonstrate that CFLP™ analysis can be used to 
discriminate between bacterial genera as well as between different species and 
subspecies (depending on the region of the 16S rRNA gene used as the substrate). A 
comparison of the CFLP™ patterns generated within the same or similar genera {e.g., 
20 Salmonella, Shigella and E. coli) shows an overall similarity in the banding pattern 

with differences revealed as changes in a small subset of the bands. When the 
comparision is made across different genra {e.g., between E. coli and S. aureus) a 
more striking change in barcode pattern is evident indicating that CFLP™ patterns 
may not only be used to detect differences between organisms, but the degree to which 
25 the patterns change may be used to assess the degree of evolutionary divergence 

between organisms. 

Substrates for CFLP™ analysis were produced by PCR amplification using 
different sets of primers. Some primer pairs (sets) are reported to be universal for all 
procaryotic organisms; other primer pairs have been observed to be specific for 
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representatives of lower taxons (See, PCT Publication WO 90/15157). Except for the 
primer sequences, no knowledge of the DNA sequence of the rRNA gene from any 
specific organism(s) is required for amplification and CFLP™ analysis of bactenal 16S 
rRNA genes. 

5 Distinct CFLP™ patterns were observed between representatives of archeaea 

and eubacteria, different phyla of eubacteria, different phyla within eubacteria, 
different subdivisions of the same phylum, different genera of the same assemblage, 
different species of the same genus and different strains of the same species. Distinct 
signatures in CFLP™ patterns were found that allowed discrimination of pathogenic 

10 folates, including those associated with food poisoning, from innocous members of the 

normal flora. 

While the PCR products generated using genomic DNA from different 
organisms with the same set of primers are indistinguishable by their mobility during 
gel electrophoresis (on non-gradient polyacrylamide gels), the Cleavase™ BN enzyme 
1 5 cleaves these PCR products into shorter fragments thereby generating a characteristic 

set of cleavage products (i.e., a distinct CFLP™ signature). The pattern of cleavage 
products generated is reproducible; DNA substrates generated in independent PCRs 
from the same organism using a given primer pair yield the same pattern of cleavage 
products. 

20 CFLP™ patterns can be generated using large DNA fragments (e.g., at least 

about 1.6 kb) and thus could cover the entire length of the bacterial 16S rRNA gene. 
CFLP™ can also be used in conjunction with shorter DNA fragments (about 200 bp) 
which are located at different positions throughout the 16S rRNA gene. 
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EXAMPLE 37 

CFLP tm Analysis of Substrates Containing Nucleotide Analogs 



The effect of using various nucleotide analogs to generate substrates for 
CFLP™ reactions was examined. As dicussed below, nucleotide analogs are used in 
PCRs for several reasons; therefore, the ability to analyze the modified products of 
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PCRs (i e , nucleotide analog-containing PCR products) by CFLP™ analysis was 
investigated. The 7-deaza purine analogs (7-deaza-dATP and 7-deaza-dGTP) serve to 
destabilize regions of secondary structure by weakening the intrastrand stacking of 
multiple adjacent purines. This effect can allow amplification of nucleic acids that, 
5 with the use of natural dNTPs, are resistant to amplification because of strong 

secondary structure [McConlogue et al, Nucleic Acids Res. 16:20 (1988)]. 

Similiarly, the analog dUTP is often used to replace dTTP, but for different 
reasons dUTP-containing DNA (this nomencature is shorthand for PCR products 
generated using dUTP; the actual PCR product will contain dUMP) can be destroyed 
10 by the enzymatic activity of uracil DNA glycosylase (UDG) while dTTP-containing 

DNA is untoched. When PCR products are produced containing dUMP in place of 
dTMP, UDG can be used in all subsequent reactions to eliminate false posirtive results 
due to' carry-over from the earlier PCRs, without preventing amplification from the 
normal DNA of interest. This method is widey used in clinical laboratories for 
1 5 performing PCR and thus this method would be used by most clinical laboratories 

using PCR in conjunction with CFLP™ for pathogen typing. Thus, the ability of the 
CFLP™ reaction to suitably cleave dUTP -containing DNA fragments (i.e., produce 
strong reproducible band patterns) was examined. 

For these companions, substrates corresponding to a 157 bp fragment derived 
20 from exon of of the wild-type and R422Q mutant of the human tryosmase gene were 

generated by PCR amplification using either 1) the standard mixture of dNTPs (i.e., 
dATP, dCTP, dGTP and dTTP); 2) dUTP in place of dTTP; 3) 7-deaza-dGTP 
(d 7 GTP) in place of dGTP; and 4) 7-deaza-dATP (d 7 ATP) in place of dATP. These 
substrates were then incubated with Cleavase™ BN enzyme and the effect the presence 
25 of the various nucleotide analogs on the cleavage pattern was examined. 

a) Preparation of Substrates Containing Nucleotide Analogs 

A 157 bp fragment of the human tyrosinase gene (exon 4) was amplified in 
PCRs using the following pair: 5' CACCGTCCTCTTCAAGAAG 3' (SEQ ID NO:42) 
and 5' biotin-CTGAATCTTGTAGATAGCTA 3' (SEQ ID NO:43). Plasmids 
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containing cDNA derived from the wild-type or R422Q mutant of the tyrosinase gene 
were used as template (see Example 10 for a description of these plasmids). The 
resulting double-stranded PGR products contain the 5' biotm label on the anti-sense 
strand such that sequence detected in the CFLP™ reaction is SEQ ID NO:48 (wild- 
type anti-sense strand) or SEQ ID NO:66 (R422Q mutant anti-sense strand). All PCRs 
were conducted in a final volume of 100 ,1 dATP, dCTP, dGTP, dTTP and dUTP 
were obtained from Perkm Elmer; d 7 ATP and d 7 GTP were obtained from Pharmacia. 
Tag DNA polymerase was obtained from Promega. The PCR mixtures were 
assembled as shown below in Table 7. 
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■ IX concentration contains 20 mM Tns-HCl, P H 8.5; 1.5 mM MgCl 2 ; 50 mM KC1; 
0.5% TWEEN 20; and 0.5% NP-40. 
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2 d 7 ATP completely substituted for dATP in the PCR. 

3 d 7 GTP completely substituted for dGTP in the PCR. 

* dUTP completely substituted for dTTP in the PCRjother nucleotides were present at 
a final concentration of 200 urn. In this reaction, the PCR buffer used was the 10X 
5 buffer (500 mM KC1, 100 mM Tns-Cl, P H 9.0, 1.0% Triton X-100) provided by 

Promega. 25 mM MgCl 2 was added separately to a final concentration of 2.5 mM. 

Wild-type and the mutant R422Q substrates were amplified using the natural 
and substituted nucleotide analogs listed above. For reactions containing the natural 
dNTPs, d 7 ATP and d 7 GTP, all reaction components were added together. Reactions 
10 containing dUTP were initially assembled without the polymerase (see below). 

The assembled reactions were placed in a thermocylcer (MJ Research, 
Watertown, MA) that was preheated to 95°C. The tubes were allowed to incubate for 
one minute at 95°C before amplification. The program was then set at 94°C for 30 
minutes, 50°C for one minute, 72°C degrees for two minutes for 34 cycles with a final 

15 72°C incubation for 5 minutes. 

Reactions containing dUTP were performed with a "hot start." All components 
except the polymerase were imxed, heated to 95°C for 1 minute, then cooled to 72°C. 
Tag polymerase (2.5 units) was then added in 10 ul of IX PCR buffer for a final 
volume of 100 ul. 

20 At the end of the amplification, the PCR products were made 0.3M NaOAc, 

with the exception of reactions containing dUTP; the dUTP-contaming reactions were 
brought to 2M NH 4 OAc; all were then precipitated by the addition of 2.5 volumes 
(total aqueous volumes) of absolute ethanol. The DNA pellets were collected by 
centrifugafion and then dried under vacuum. The pellets were resuspended in 10 ul of 

25 T10E0.1 buffer [10 mM Tris-HCl, P H 8.0; 0.1 mM EDTA] and 10 ul of STOP 

solution (95% formamide, 10 mM EDTA, 0.05% each bromophenol blue and xylene 
cyanol) (20 ul T10E0.1 and 16 ul of STOP for the dUTP-containing reactions). The 
tubes were then heated to 85°C for 2 minutes and the mixtures were resolved by 
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electrophoresis through 10% (6% for dUTP) denaturing acrylamide gel (19:1 cross 
Hnk) with 7M urea in a buffer of 45 mM Tris Borate, P H 8.3, 1.4 mM EDTA. 

The PCR products corresponding to the 157 bp substrate derived from the wild- 
type and R422Q mutant were gel purified as described in Example 21. The gel- 
purified DNAs were resuspended in T10E0.1 buffer using the following volumes: 40 
Hi for fragments containing only dNTPs; 40 ul for fragments containing d 7 ATP; 25 ul 
for fragments containing d 7 GTP and 25 ul for fragments containing dUTP. 

b) Cleavage Reaction Conditions 

The gel purified 157 bp tyrosinase substrates containing natural 
deoxynucleotides and nucleotide analogs were analyzed in cleavage reactions as 
follows Final reaction mixtures comprised 1 ul of the resuspended gel-purified DNA 
[see section (a) above] and 25 ng Cleavase™ BN in 10 mM MOPS, pH 7.5 with 0.2 
mM MnCl 2 , and 0.05% each TWEEN 20 and NP-40 in a volume of 20 ul. No 
enzyme controls were assembled in which distilled water replaced the Cleavase™ BN 
enzyme. The subsutrate DNAs were distributed into reaction tubes and brought to a 
volume of 15 ul with H 2 0. The remaining reaction components were mixed in a 
volume of 5 ul (i.*., at a 4X concentration). The DNAs were heated for 15 sec. at 
95°C to denature the DNA. The cleavage reactions were initiated by the addition of 5 
Hi of the enzyme/buffer mixture (the 4X concentrate). The cleavage reactions were 
incubated at 45°C for three minutes, and the reactions were terminated by the addition 
of 16 ul of Stop solution (described in sction a). Seven microliters of each sample 
was heated to 85°C for two minutes prior to loading onto a 10% denaturing 
acrylamide gel (19:1 cross linke), with 7M urea in a buffer of 45 mM Tris Borate pH 
8.3, 1.4 mM EDTA.. The gel was run at a constant 800 V until the bromophenol blue 

had migrated the length of the gel. 

Following electrophoresis, the biotinylated fragments were detected as described 
in Example 10 with the exception that 4 ul of the SAAP conjugate was added to 100 
ml of USB blocking buffer (1:25,000 dilution). After washing, 5 mis of CDP-Star™ 
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was used as the chemiluminescent substrate. The resulting autoradiogram is shown in 
Figure 94. 

In Figure 94, the lanes marked "M" contain biotinylated molecular weight 
markers obtained from Amersham (Arlington Heights, IL) and include bands 

5 corresponding to lengths of 50, 100 and 200 nucleotides (size indicated by use of 

numbers and large arrowheads). Lanes 1-8 contain reaction products obtained by 
incubation of the substrates in the absence of Cleavase™ BN enzyme (i.e., no enzyme 
or uncut controls). Lanes 9-16 contain reaction products obtained by incubation of the 
substrates in the presence of Cleavase™ BN enzyme. Lanes 1, 3, 5, 7, 9, 11, 13 and 

10 15 contain the wild-type substrate; lanes 2, 4, 6, 8, 10, 12, 14 and 16 contain the 

R422Q mutant substrate. The products shown in lanes 1, 2, 9 and 10 were generated 
from substrates generated using dNTPs in the PCRs. The products shown in lanes 3, 
4, 11 and 12 were generated from substrates generated using dUTP in place of dTTP 
in the PCRs. The products shown in lanes 5, 6, 13 and 14 were generated from 

15 substrates generated using d 7 GTP in place of dGTP in the PCRs. The products shown 

in lanes 7, 8, 15 and 16 were generated from substrates generated using d 7 ATP in 
place of dATP in the PCRs. It can be seen from this example that modified DNA 
fragments are suitable for cleavage in CFLP reactions. Though the banding pattern is 
substantially different with these substitions, the wild-type and R422Q mutant DNAs 

20 are readily distinguishable in all cases. 

While not limiting the invention to any particular theory, the changes in 
banding patterns observed when nucleotide analogs are utilized can be attributed to 
two sources. In all cases, but particularly in reference to the 7-deaza purines, the use 
of nucleotide analogs may substantially change the nature and stability of the 

25 intrastrand folded structures formed during the cleavage reaction. As a consequence, 

the locations of the cleavage sites would naturally shift. In addition, the substitution 
of the modified nucleotides may change the affinity of the cleavage enzyme for the 
folded cleavage structure, either strengthening or weakening cleavage at a particular 
site. 
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Examination of the variations seen between the wild-type and R422Q mutant 
when different analogs are used also shows that the use of these substituants can 
enhance the contrast between the varients. For example, with regard to the cleavage 
products of the two substrate DNAs (generated using dUTP or dTTP) in the region 
just above the 50 bp marker: one significant band that reduces in intensity between the 
wild-type and the mutant is more dramatically reduced in the dU-containing samples. 

The results shown in Figure 94 demonstrate that nucleotide analogs may be 
used for the generation of CFLP™ substrates. The substrates derived from the wild- 
type or R422Q mutant of the tyrosinase gene which contain nucleotide analogs 
produce distinct cleavage patterns which allow the discrimination and identification of 
the mutant and wild-type alleles. 

This example demonstrates that even with 100% substitution with either 7- 
deaza-GTP for dGTP or 7-deaza-ATP for dATP, robust CFLP patterns are generated, 
although the precise sites of clevage are different in the dNTP-containing and 7-deaza- 
dNTP containing substrates. The above results also demonstrated that single base 
changes present within DNA fragments containing nucleotide analogs still inflence the 
folded structure sufficiently to cause cleavage pattern changes similar to those seen 
when DNA fragments lacking nucleotide analogs are analyzed using the CFLP™ 
assay. 

From the above it is clear that the invention provides reagents and methods to 
permit the rapid screening of nucleic acid sequences for variations. These methods 
allow the identification of viral and bacterial pathogens as well as permit the detection 
of mutations associated with gene sequences {e.g., mutations associated with multiple 
drug resistance in M. tuberculosis or mutations associated with human disease). These 
methods provide improved means for the identification and characterization of 
pathogens. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: BROW, MARY ANN D. 

LYAMICHEV, VICTOR I. 
OLIVE , DAVID M. 

(ii) TITLE OF INVENTION: RAPID DETECTION AND IDENTIFICATION OF 
PATHOGENS 

(iii) NUMBER OF SEQUENCES: 160 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: MEDLEN & CARROLL 

(B) STREET: 220 MONTGOMERY STREET, SUITE 2200 

(C) CITY: SAN FRANCISCO 

(D) STATE: CALIFORNIA 

(E) COUNTRY: UNITED STATES OF AMERICA 

(F) ZIP: 94104 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM : PC -DOS /MS -DOS _ 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER : US 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: CARROLL, PETER G. 

(B) REGISTRATION NUMBER: 32,837 

(C) REFERENCE/DOCKET NUMBER: FORS-01756 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 705-8410 

(B) TELEFAX: (415) 397-8338 

( 2 ) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2506 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ATGAGGGGGA TGCTGCCCCT CTTTGAGCCC AAGGGCCGGG TCCTCCTGGT GGACGGCCAC 6 0 

CACCTGGCCT ACCGCACCTT CCACGCCCTG AAGGGCCTCA CCACCAGCCG GGGGGAGCCG 12 0 

GTGCAGGCGG TCTACGGCTT CGC C AAGAGC CTCCTCAAGG CCCTCAAGGA GGAC GGGGAC 

GCGGTGATCG TGGTCTTTGA CGCCAAGGCC CCCTCCTTCC GCCACGAGGC CTACGGGGGG 
TACAAGGCGG GCCGGGCCCC CACGCCGGAG GACTTTCCCC GGCAACTCGC CCTCATCAAG 
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GAGCTGGTGG ACCTCCTGGG GCTGGCGCGC CTCGAGGTCC CGGGCTACGA GGCGGACGAC 
GTCCTGGCCA GCCTGGCCAA GAAGGCGGAA AAGGAGGGCT ACGAGGTCCG CATCCTCACC 
GCCGACAAAG ACCTTTACCA GCTCCTTTCC GACCGCATCC ACGTCCTCCA CCCCGAGGGG 
TAC C T CATC A CCCCGGCCTG GCTTTGGGAA AAGTACGGCC TGAGGCCCGA CCAGTGGGCC 
GACTACCGGG CCCTGACCGG GGACGAGTCC GACAACCTTC CCGGGGTCAA GGGCATCGGG 
GAGAAGACGG CGAGGAAGCT TCTGGAGGAG TGGGGGAGCC TGGAAGCCCT C CTC AAGAAC 
CTGGACCGGC TGAAGCCCGC CATCCGGGAG AAGATCCTGG C CC AC ATGGA CGATCTGAAG 
CTCTCCTGGG ACCTGGCCAA GGTGCGCACC GACCTGCCCC TGGAGGTGGA CTTCGCCAAA 
AGGCGGGAGC CCGACCGGGA GAGGCTTAGG GCCTTTCTGG AGAGGCTTGA GTTTGGCAGC 
CTCCTCCACG AGTTCGGCCT TCTGGAAAGC CCCAAGGCCC TGGAGGAGGC CCCCTGGCCC 
CCGCCGGAAG GGGCCTTCGT GGGCTTTGTG CTTTCCCGCA AGGAGCCCAT GTGGGCCGAT 
CTTCTGGCCC TGGCCGCCGC CAGGGGGGGC CGGGTCCACC GGGCCCCCGA GC CTTATAAA 
GCCCTCAGGG ACCTGAAGGA GGCGCGGGGG CTTCTCGCCA AAGAC C TGAG CGTTCTGGCC 
CTGAGGGAAG GCCTTGGCCT CCCGCCCGGC GACGACCCCA TGCTCCTCGC CTACCTCCTG 
GACCCTTCCA ACACCACCCC CGAGGGGGTG GCCCGGCGCT ACGGCGGGGA GTGGACGGAG 
GAGGCGGGGG AGCGGGC CGC CCTTTCCGAG AGGCTCTTCG CCAACCTGTG GGGGAGGCTT 
GAGGGGGAGG AGAGGCTCCT TTGGCTTTAC CGGGAGGTGG AGAGGCCCCT TTCCGCTGTC 
CTGGCCCACA TGGAGGCCAC GGGGGTGCGC CTGGACGTGG CCTATCTCAG GGCCTTGTCC 
CTGGAGGTGG C CGAGGAGAT CGCCCGCCTC GAGGCCGAGG TCTTCCGCCT GGCCGGCCAC 
CCCTTCAACC TCAACTCCCG GGACCAGCTG GAAAGGGTCC TCTTTGACGA GCTAGGGCTT 
CCCGCCATCG GCAAGACGGA GAAGACCGGC AAGCGCTCCA CCAGCGCCGC CGTCCTGGAG 
GCCCTCCGCG AGGCCCACCC CATCGTGGAG AAGATCCTGG AGTACCGGGA GCTCACCAAG 
CTGAAGAGCA CCTACATTGA CCCCTTGCCG GACCTCATCC ACCCCAGGAC GGGCCGCCTC 
CACACCCGCT T C AAC C AG AC GGCCACGGCC ACGGGCAGGC TAAGTAGCTC CGATCCCAAC 
CTCCAGAACA TCCCCGTCCG CACCCCGCTT GGGCAGAGGA TCCGCCGGGC CTTCATCGCC 
GAGGAGGGGT GGCTATTGGT GGCCCTGGAC TATAGCCAGA TAGAGCTCAG GGTGCTGGCC 
CACCTCTCCG GCGACGAGAA CCTGATCCGG GTCTTCCAGG AGGGGCGGGA CATCCACACG 
GAGACCGCCA GCTGGATGTT CGGCGTCCCC CGGGAGGCCG TGGACCCCCT GATGCGCCGG 
GCGGCCAAGA CCATCAACTT CGGGGTCCTC TACGGCATGT CGGCCCACCG CCTCTCCCAG 
GAGCTAGCCA TCCCTTACGA GGAGGCCCAG GCCTTCATTG AGCGCTACTT TCAGAGCTTC 
CCCAAGGTGC GGGCCTGGAT TGAGAAGACC C TGGAGGAGG GCAGGAGGCG GGGGTACGTG 
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GAGACCCTCT TCGGCCGCCG CCGCTACGTG CCAGACCTAG 
CGGGAGGCGG CCGAGCGCAT GGCCTTCAAC ATGCCCGTCC 
ATGAAGCTGG CTATGGTGAA GCTCTTCCCC AGGCTGGAGG 
CTTCAGGTCC ACGACGAGCT GGTCCTCGAG GCCCCAAAAG 
CGGCTGGCCA AGGAGGTCAT GGAGGGGGTG TATCCCCTGG 
GT GGGG AT AG GGGAGGACTG GCTCTCCGCC AAGGAGTGAT 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2496 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
ATGGCGATGC TTCCCCTCTT TGAGCCCAAA GGCCGCGTGC 
CTGGCCTACC GCACCTTCTT TGCCCTCAAG GGCCTCACCA 
CAGGCGGTCT ACGGCTTCGC CAAAAGCCTC CTCAAGGCCC 
GTGGTGGTGG TCTTTGACGC CAAGGCCCCC TCCTTCCGCC 
AAGGCGGGCC GGGCCCCCAC CCCGGAGGAC TTTCCCCGGC 
TTGGTGGACC TCCTAGGCCT TGTGCGGCTG GAGGTTCCCG 
CTGGCCACCC TGGCCAAGCG GGCGGAAAAG GAGGGGTACG 
GACCGCGACC TCTACCAGCT CCTTTCGGAG CGCATCGCCA 
CTGATCACCC CGGCGTGGCT TTACGAGAAG TACGGCCTGC 
TACCGGGCCC TGGCGGGGGA CCCCTCGGAT AACATCCCCG 
AAGACCGCCC AGAGGCTCAT CCGCGAGTGG GGGAGCCTGG 
GACCAGGTGA AGCCCTCCTT GCGGGAGAAG CTCCAGGCGG 
TCCCGGAAGC TTTCCCAGGT GCACACTGAC CTGCCCCTGG 
CGCACACCCA ACCTGGAGGG TCTGCGGGCT TTTTTGGAGC 
CTCCACGAGT TCGGCCTCCT GGAGGGGCCG AAGGCGGCAG 
CCGGAAGGGG CTTTTTTGGG CTTTTCCTTT TCCCGTCCCG 
CTGGCCCTGG CTGGGGCGTG GGAGGGGCGC CTCCATCGGG 
CTGAGGGACC TTAAGGGGGT GCGGGGAATC CTGGCCAAGG 
CGGGAGGGCC TGGACCTCTT CCCAGAGGAC GACCCCATGC 



AGGCCCGGGT GAAGAGCGTG 
AGGGCACCGC CGCCGACCTC 
AAATGGGGGC CAGGATGCTC 
AGAGGGCGGA GGCCGTGGCC 
CCGTGCCCCT GGAGGTGGAG 
ACCACC 



TCCTGGTGGA 
CCAGCCGCGG 
TGAAGGAGGA 
ACGAGGCCTA 
AGCTGGCCCT 
GCTTTGAGGC 
AGGTGCGCAT 
TCCTCCACCC 
GCCCGGAGCA 
GGGT G AAGGG 
AAAACCTCTT 
GCATGGAGGC 
AGGTGGACTT 
GGTTGGAGTT 
AGGAGGCCCC 
AGCCCATGTG 
CACAAGACCC 
ACCTGGCGGT 
TCCTGGCCTA 



CGGCCACCAC 
CGAACCCGTT 
CGGGGACGTG 
CGAGGC CTAC 
CATCAAGGAG 
GGACGACGTG 
CCTCACTGCC 
TGAGGGGTAC 
GTGGGTGGAC 
CATCGGGGAG 
CCAGCACCTG 
CCTGGCCCTT 
CGGGAGGCGC 
TGGAAGCCTC 
CTGGCCCCCT 
GGCCGAGCTT 
CCTTAGGGGC 
TTTGGCCCTG 
CCTTCTGGAC 
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CCCTCCAACA CCACCCCTGA GGGGGTGGCC CGGCGTTACG GGGGGGAGTG GACGGAGGAT 
GCGGGGGAGA GGGCCCTCCT GGCCGAGCGC CTCTTCCAGA CCCTAAAGGA GCGCCTTAAG 
GGAGAAGAAC GCCTGCTTTG GCTTTACGAG GAGGTGGAGA AGCCGCTTTC CCGGGTGTTG 
GCCCGGATGG AGGC C ACGGG GGTCCGGCTG GACGTGGCCT ACCTCCAGGC CCTCTCCCTG 
GAGGTGGAGG CGGAGGTGCG CCAGCTGGAG GAGGAGGTCT TCCGCCTGGC CGGCCACCCC 
TTCAACCTCA ACTCCCGCGA CCAGCTGGAG CGGGTGCTCT TTGACGAGCT GGGCCTGCCT 
GCCATCGGCA AGACGGAGAA GACGGGGAAA CGCTCCACCA GCGCTGCCGT GCTGGAGGCC 
CTGCGAGAGG CCCACCCCAT CGTGGACCGC ATCCTGCAGT AC C GGG AG C T CACCAAGCTC 
AAGAACACCT ACATAGACCC CCTGCCCGCC CTGGTCCACC CCAAGACCGG CCGGCTCCAC 
ACCCGCTTCA ACCAGACGGC CACCGCCACG GGCAGGCTTT CCAGCTCCGA CCCCAACCTG 
C AGAAC AT C C CCGTGCGCAC CCCTCTGGGC CAGCGCATCC GCCGAGCCTT CGTGGCCGAG 
GAGGGCTGGG TGCTGGTGGT CTTGGACTAC AGCCAGATTG AGCTTCGGGT CCTGGCCCAC 
CTCTCCGGGG ACGAGAACCT GATCCGGGTC TTTCAGGAGG GGAGGGACAT CCACACCCAG 
ACCGCCAGCT GGATGTTCGG CGTTTCCCCC GAAGGGGTAG ACCCTCTGAT GCGCCGGGCG 
GCCAAGACCA TCAACTTCGG GGTGCTCTAC GGCATGTCCG CCCACCGCCT CTCCGGGGAG 
CTTTCCATCC CCTACGAGGA GGCGGTGGCC TTCATTGAGC GCTACTTCCA GAGCTACCCC 
AAGGTGCGGG CCTGGATTGA GGGGAC C CTC GAGGAGGGCC GCCGGCGGGG GTATGTGGAG 
ACCCTCTTCG GCCGCCGGCG CTATGTGCCC GACCTCAACG CCCGGGTGAA GAGCGTGCGC 
GAGGCGGCGG AGCGCATGGC CTTCAACATG CCGGTCCAGG GCACCGCCGC CGACCTCATG 
AAGCTGGC C A TGGTGCGGCT TTTCCCCCGG CTTCAGGAAC TGGGGGCGAG GATGCTTTTG 
CAGGTGCACG ACGAGCTGGT CCTCGAGGCC CCCAAGGACC GGGCGGAGAG GGTAGCCGCT 
TTGGCCAAGG AGGTCATGGA GGGGGTCTGG CCCCTGCAGG TGCCCCTGGA GGTGGAGGTG 
GGCCTGGGGG AGGACTGGCT CTCCGCCAAG GAGTAG 
(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
ATGGAGGCGA TGCTTCCGCT CTTTGAACCC AAAGGCCGGG TCCTCCTGGT GGACGGCCAC 
CACCTGGCCT ACCGCACCTT CTTCGCCCTG AAGGGCCTCA CCACGAGCCG GGGCGAACCG 
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GTGCAGGCGG TCTACGGCTT CGCCAAGAGC CTCCTCAAGG CCCTGAAGGA GGACGGGTAC 
AAGGCCGTCT TCGTGGTCTT TGACGCCAAG GCCCCCTCCT TCCGCCACGA GGCCTACGAG 
GCCTACAAGG CGGGGAGGGC CCCGACCCCC GAGGACTTCC CCCGGCAGCT CGCCCTCATC 
AAGGAGCTGG TGGACCTCCT GGGGTTTACC CGCCTCGAGG TCCCCGGCTA CGAGGCGGAC 
GACGTTCTCG CCACCCTGGC CAAGAAGGCG GAAAAGGAGG GGTACGAGGT GCGCATCCTC 
A CCGCCGACC GCGACCTCTA CCAACTCGTC TCCGACCGCG TCGCCGTCCT CCACCCCGAG 
GGCCACCTCA TCACCCCGGA GTGGCTTTGG G AG AAGT AC G GCCTCAGGCC GGAGCAGTGG 
GTGGACTTCC GCGCCCTCGT GGGGGACCCC TCCGACAACC TCCCCGGGGT CAAGGGCATC 
GGGGAGAAGA CCGCCCTCAA GCTCCTCAAG GAGTGGGGAA GCCTGGAAAA CCTCCTCAAG 
AACCTGGACC GGGTAAAGCC AGAAAACGTC C GGGAG AAGA TCAAGGCCCA CCTGGAAGAC 
CTCAGGCTCT CCTTGGAGCT CTCCCGGGTG CGCACCGACC TCCCCCTGGA GGTGGACCTC 
GCCCAGGGGC GGGAGCCCGA CCGGGAGGGG CTTAGGGCCT TCCTGGAGAG GCTGGAGTT C 
GGCAGCCTCC TCCACGAGTT CGGCCTCCTG GAGGCCCCCG CCCCCCTGGA GGAGGCCCCC 
TGGCCCCCGC CGGAAGGGGC CTTCGTGGGC TTCGTCCTCT CCCGCCCCGA GCCCATGTGG 
GCGGAGCTTA AAGCCCTGGC CGCCTGCAGG GACGGCCGGG TGCACCGGGC AGCAGACCCC 
TTGGCGGGGC TAAAGGACCT CAAGGAGGTC CGGGGCCTCC TCGCCAAGGA CCTCGCCGTC 
TTGGCCTCGA GGGAGGGGCT AGACCTCGTG CCCGGGGACG ACCCCATGCT CCTCGCCTAC 
CTCCTGGACC CCTCCAACAC CACCCCCGAG GGGGTGGCGC GGCGCTACGG GGGGGAGTGG 
ACGGAGGACG CCGCCCACCG GGCCCTCCTC TCGGAGAGGC TCCATCGGAA CCTCCTTAAG 
CGCCTCGAGG GGGAGGAGAA GCTCCTTTGG CTCTACCACG AGGT GGAAAA GCCCCTCTCC 
CGGGTCCTGG CC C ACATGGA GGCCACCGGG GTACGGCTGG ACGTGGCCTA CCTTCAGGCC 
CTTTCCCTGG AGCTTGCGGA GGAGATCCGC CGCCTCGAGG AGGAGGTCTT CCGCTTGGCG 
GGCCACCCCT TCAACCTCAA CTCCCGGGAC C AG C TGGAAA GGGTGCTCTT TGACGAGCTT 
AGGCTTCCCG CCTTGGGGAA GACGCAAAAG ACAGGCAAGC GCTCCACCAG CGCCGCGGTG 
CTGGAGGCCC TACGGGAGGC CCACCCCATC GTGGAGAAGA TCCTCCAGCA CCGGGAGCTC 
ACCAAGCTCA AGAACACCTA CGTGGACCCC CTCCCAAGCC TCGTCCACCC GAGGACGGGC 
CGCCTCCACA CCCGCTTCAA CCAGACGGCC ACGGCCACGG GGAGGCTTAG TAGCTCCGAC 
CCCAACCTGC AGAACATCCC CGTCCGCACC CCCTTGGGCC AGAGGATC CG CCGGGCCTTC 
GTGGCCGAGG CGGGTTGGGC GTTGGTGGCC CTGGACTATA GCCAGATAGA GCTCCGCGTC 
CTCGCCCACC TCTCCGGGGA CGAAAACCTG AT C AGGGT C T TCCAGGAGGG GAAGGACATC 
CACACCCAGA CCGCAAGCTG GATGTTCGGC GTCCCCCCGG AGGCCGTGGA CCCCCTGATG 
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CGCCGGGCGG 


CCAAGACGGT 


GAACTTCGGC 


/~tm f~* f^T* T 1 7\ (~* 

GTCL, 1 L- 1 AL-tj 


rcc&TGTCCGC 


CCATAGGCTC 


2040 


TCCCAGGAGC 


TTGCCATCCC 


CTACGAGGAG 




TTATAGAGGC 


TACTTCCAAA 


2100 


GCTTCCCCAA 


GGTGCGGGCC 


TGGATAGAAA 




GGAGGGGAGG 


AAGCGGGGCT 


2160 


ACGTGGAAAC 


CCTCTTCGGA 


AGAAGGCGCT 




CCTCAACGCC 


CGGGTGAAGA 


2220 


GCGTCAGGGA 


GGCCGCGGAG 




TCAACATGCC 


CGTCCAGGGC 


ACCGCCGCCG 


2280 


ACCTCATGAA 


/-i rn c* /i r* ri 7\ T 1 CI 
GC 1 LW-Lftiu 


GTGAAGCTCT 


TCCCCCGCCT 


CCGGGAGATG 


GGGGCCCGCA 


2340 


TGCTCCTCCA 


GGTCCACGAC 


GAGCTCCTCC 


Tf^fJAOClCCCC 

X OuflVJ'J'- v — 


CCAAGCGCGG 


GCCGAGGAGG 


2400 


TGGCGGCTTT 


GGCCAAGGAG 


GCCATGGAGA 


AGGCCTATCC 


CCTCGCCGTG 


CCCCTGGAGG 


24S0 


TGGAGGTGGG 


GATGGGGGAG 


GACTGGCTTT 


CCGCCAAGGG 


TTAG 




2504 


(2) INFORMATION FOR SEQ ID NO : 4 : 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi 



) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



Met Arg Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 
1 5 10 

Ala Tyr Arg Thr Phe His Ala Leu Lys Gly 



Val Asp Gly His His Leu 



20 



Leu Thr Thr Ser Arg 
35 



Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala 
x 45 



40 



Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He Val 
50 55 



70 



Val Phe Asp Ala Lys 
65 



Tyr Lys Ala Gly Arg Ala Pro 
85 



Ala Pro Ser Phe Arg His Glu Ala Tyr Gly Gly 



75 



Thr Pro Glu Asp Phe Pro Arg Gin Leu 



90 



95 



Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu Glu 



100 



Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys Lys 



115 



Ala Glu Lys 
130 



120 



Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys Asp 



135 



Leu Tyr Gin Leu Leu Ser Asp Arg He Hi 



Val Leu His Pro Glu Gly 



145 



150 



155 



160 



Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro 



165 
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Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp Asn 



180 



Leu Pro 



Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu Leu 



195 



200 



Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu 



210 



215 



Lys Pro Ala He Arg 



Glu Lys He Leu Ala His Met Asp Asp Leu Lys 



225 



230 



235 



t«,, m= Tires val Ara Thr Asp Leu Pro Leu Glu Val 
Leu Ser Trp Asp Leu Ala Lys val Arg ini. **t> 



245 



Asp Phe Ala Lys Arg Arg 



Glu Pro Asp Arg Glu Arg Leu Arg Ala Phe 



260 



Leu Glu Arg Leu Glu Phe 



265 



270 



Gly Ser Leu Leu His Glu Phe Gly Leu Leu 



275 



280 



285 



Glu 



Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 



290 



295 



Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala Asp 



305 



310 



Leu Leu Ala Leu Ala Ala 



Ala Arg Gly Gly Arg Val His Arg Ala Pro 

3 3 5 



325 



330 



Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu Leu 



340 



Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu Pro 



355 



360 



Pro 



Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 



370 



375 



Thr Thr Pro Glu 
385 



Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 



390 



395 



Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn Leu 



405 



Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg Glu 



420 



Val Glu Arg Pro Leu Ser Ala 



Val Leu Ala His Met Glu Ala Thr Gly 



435 



440 



445 



Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val Ala 



450 



455 



Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly His 



465 



470 



Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp 



485 



490 



Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys Arg 



500 



505 
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Ser Thr Ser Ala Ala Val Leu 



Glu Ala Leu Arg Glu Ala His Pro He 



515 



520 



525 



Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser Thr 



530 



535 



Tyr He Asp Pro Leu Pro Asp Leu 



He His Pro Arg Thr Gly Arg Leu 



545 



His 



550 



555 



560 



Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 



565 



S er Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 



580 



Arg He Arg Arg 
595 



Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val Ala 



600 



605 



Leu Asp Tyr Ser Gin He Glu Leu 



Arg Val Leu Ala His Leu Ser Gly 



610 



615 



620 



Asp 
625 



Glu Asn Leu He Arg Val Phe 



Gin Glu Gly Arg Asp He His Thr 



630 



635 



640 



Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro 



645 



Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly 

66b 



660 



Met Ser Ala His Arg 
675 



Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu 



680 



685 



Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 



690 



695 



Ala Trp He Glu Lys Thr Leu 



Glu Glu Gly Arg Arg Arg Gly Tyr Val 



705 



710 



715 



720 



Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala Arg 



725 



Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 



740 



Val Gin Gly Thr Ala Ala Asp 



Leu Met Lys Leu Ala Met Val Lys Leu 



755 



760 



765 



Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val His 



770 



775 



Asp Glu Leu Val Leu Glu Ala Pro Lys 



Glu Arg Ala Glu Ala Val Ala 



785 



790 



795 



800 



Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val Pro 



805 



Leu 



Glu Val Glu Val Gly He Gly Glu Asp 



Glu Asp Trp Leu Ser Ala Lys Glu 



820 



825 



830 



(2) INFORMATION FOR SEQ ID NO : 5 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Met Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
1 5 10 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 

20 25 
Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 

35 40 
Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Val Val Val Val Val 

50 55 
Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu Ala Tyr 
65 70 75 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 

Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Val Arg Leu Glu Val 

100 105 
Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Arg Ala 

115 120 
Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg Asp Leu 

130 135 
Tyr Gin Leu Leu Ser Glu Arg He Ala lie Leu His Pro Glu Gly Tyr 
145 150 155 

Leu lie Thr Pro Ala Trp Leu Tyr Glu Lys Tyr Gly Leu Arg Pro Glu 
165 170 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn He 
180 185 

Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Gin Arg Leu He Arg 
2_ 95 2 00 

Glu Trp Gly Ser Leu Glu Asn Leu Phe Gin His Leu Asp Gin Val Lys 

210 215 
Pro Ser Leu Arg Glu Lys Leu Gin Ala Gly Met Glu Ala Leu Ala Leu 
225 230 235 

Ser Arg Lys Leu Ser Gin Val His Thr Asp Leu Pro Leu Glu Val Asp 

245 250 
Phe Gly Arg Arg Arg Thr Pro Asn Leu Glu Gly Leu Arg Ala Phe Leu 

260 265 
Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu Glu 
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275 280 285 



Gly Pro Lys Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala 

290 295 
Phe Leu Gly Phe Ser Phe Ser Arg Pro Glu Pro Met Trp Ala Glu Leu 
305 310 315 

Leu Ala Leu Ala Gly Ala Trp Glu Gly Arg Leu His Arg Ala Gin Asp 
325 330 

Pro Leu Arg Gly Leu Arg Asp Leu Lys Gly Val Arg Gly lie Leu Ala 

340 345 
Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly Leu Asp Leu Phe Pro 

355 360 
Glu Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr 
370 375 380 

Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu Asp 
385 390 395 

Ala Gly Glu Arg Ala Leu Leu Ala Glu Arg Leu Phe Gin Thr Leu Lys 

405 410 
Glu Arg Leu Lys Gly Glu Glu Arg Leu Leu Trp Leu Tyr Glu Glu Val 

420 425 
Glu Lys Pro Leu Ser Arg Val Leu Ala Arg Met Glu Ala Thr Gly Val 

435 440 44b 

Arg Leu Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Val Glu Ala 



450 



Glu Val Arg Gin Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His Pro 
465 470 475 

Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp Glu 

485 4yo 
Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys Arg Ser 



500 



Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He Val 



515 



520 



Asp Arg lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Asn Thr Tyr 



530 535 540 



lie Asp Pro Leu Pro Ala Leu Val His Pro Lys Thr Gly Arg Leu His 
545 550 

Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser 
565 570 

Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin Arg 
580 585 

lie Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Val Leu Val Val Leu 
3 600 605 



595 
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Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp 

610 615 
Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His Thr Gin 
625 6 30 " 5 

Thr Ala Ser Trp Met Phe Gly Val Ser Pro Glu Gly Val Asp Pro Leu 
645 650 

Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr Gly Met 

660 66b 
Ser Ala His Arg Leu Ser Gly Glu Leu Ser lie Pro Tyr Glu Glu Ala 

675 680 
Val Ala Phe He Glu Arg Tyr Phe Gin Ser Tyr Pro Lys Val Arg Ala 

690 695 
Trp He Glu Gly Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val Glu 
705 710 715 

Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg Val 
725 730 

Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro Val 

740 74b 
Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Arg Leu Phe 

755 760 
Pro Arg Leu Gin Glu Leu Gly Ala Arg Met Leu Leu Gin Val His Asp 

770 775 
Glu Leu Val Leu Glu Ala Pro Lys Asp Arg Ala Glu Arg Val Ala Ala 
785 790 795 

Leu Ala Lys Glu Val Met Glu Gly Val Trp Pro Leu Gin Val Pro Leu 



805 



Glu Val Glu Val Gly Leu Gly Glu Asp Trp Leu Ser Ala Lys Glu 
820 825 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 34 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 
1 5 10 



35 40 



Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly 

20 25 
Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala 

_ _ /in *± 
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Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala Val Phe 
Y 50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 



85 



Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 IO 5 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg 
130 135 140 

Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His Pro Glu 
14 5 150 I 55 

Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Arg 



165 



Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro Ser Asp 



180 



Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu Lys Leu 

195 200 205 

Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu Asp Arg 

215 220 



210 



Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu Glu Asp 



225 



230 



Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu Pro Leu 
245 250 

Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly Leu Arg 

265 z ' u 



260 



Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 

275 280 285 

Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro Pro Pro 



290 



295 300 



Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro Met Trp 



305 



310 



Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val His Arg 
330 J -S3 



325 



Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly 
340 345 ^=>v 

Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp 
355 360 365 

Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro 

** — *- 3 8 0 



370 375 
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Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Olu Trp 



385 



Thr 



390 



Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg 



405 



410 



Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr 



420 



425 



His Glu Val Glu Lys Pro Leu 
435 



Ser Arg Val Leu Ala His Met Glu Ala 



440 



445 



Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu 



450 



455 



Leu Ala Glu Glu lie Arg Arg Leu 



Glu Glu Glu Val Phe Arg Leu Ala 



465 



470 



475 



480 



Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 



485 



Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly 



500 



Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His 



515 



Pro He Val Glu Lys He Leu 



Gin His Arg Glu Leu Thr Lys Leu Lys 



530 



535 



540 



Asn Thr Tyr Val Asp Pro Leu Pro Ser 



Leu Val His Pro Arg Thr Gly 



545 



550 



555 



560 



Arg Leu His Thr Arg Phe Asn 



Gin Thr Ala Thr Ala Thr Gly Arg Leu 



565 



570 



575 



Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu 



580 



585 



Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu 



595 



600 



Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu 



610 



Ser Gly Asp 



615 



Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He 



625 



His 



630 



635 



Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val 



645 



650 



Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 



660 



Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr 



680 



685 



Tyr Gly Met Ser 
675 

Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys 



690 



695 



Val Arg Ala Trp He Glu Lys 



Thr Leu Glu Glu Gly Arg Lys Arg Gly 



705 



710 



715 



720 
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Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn 
725 730 

Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val 
755 7 60 765 

Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin 

770 77 5 780 

Val His Asp Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu 
785 7 90 795 

Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala 
805 810 

Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala 
820 825 

Lys Gly 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
ATGNNGGCGA TGCTTCCCCT CTTTGAGCCC AAAGGCCGGG TCCTCCTGGT GGACGGCCAC 
CACCTGGCCT ACCGCACCTT CTTCGCCCTG AAGGGCCTCA CCACCAGCCG GGGCGAACCG 
GTGCAGGCGG TCTACGGCTT CGCCAAGAGC CTCCTCAAGG CCCTGAAGGA GGACGGGGAC 
NNGGCGGTGN TCGTGGTCTT TGACGCCAAG GCCCCCTCCT TCCGCCACGA GGCCTACGAG 
GCCTACAAGG CGGGCCGGGC CCCCACCCCG GAGGACTTTC CCCGGCAGCT CGCCCTCATC 
AAGGAGCTGG TGGACCTCCT GGGGCTTGCG CGCCTCGAGG TCCCCGGCTA CGAGGCGGAC 
GACGTNCTGG CCACCCTGGC CAAGAAGGCG GAAAAGGAGG GGTACGAGGT GCGCATCCTC 
ACCGCCGACC GCGACCTCTA CCAGCTCCTT TCCGACCGCA TCGCCGTCCT CCACCCCGAG 
GGGTACCTCA TCACCCCGGC GTGGCTTTGG G AGAAGT AC G GCCTGAGGCC GGAGCAGTGG 
GTGGACTACC GGGCCCTGGC GGGGGACCCC TCCGACAACC TCCCCGGGGT CAAGGGCATC 
GGGGAGAAGA CCGCCCNGAA GCTCCTCNAG GAGTGGGGGA GCCTGGAAAA CCTCCTCAAG 
AACCTGGACC GGGTGAAGCC CGCCNTCCGG GAGAAGATCC AGGCCCACAT GGANGACCTG 
ANGCTCTCCT GGGAGCTNTC CCAGGTGCGC ACCGACCTGC CCCTGGAGGT GGACTTCGCC 
AAGNGGCGGG AGCCCGACCG GGAGGGGCTT AGGGCCTTTC TGGAGAGGCT GGAGTTTGGC 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



^0 



AGCCTCCTCC ACGAGTTCGG CCTCCTGGAG GGCCCCAAGG 
CCCCCGCCGG AAGGGGCCTT CGTGGGCTTT GTCCTTTCCC 
GAGCTTCTGG CCCTGGCCGC CGCCAGGGAG GGCCGGGTCC 
ANGGGCCTNA GGGACCTNAA GGAGGTGC GG GGNCTCCTCG 
GCCCTGAGGG AGGGCCTNGA CCTCNTGCCC GGGGACGACC 
CTGGACCCCT CCAACACCAC CCCCGAGGGG GTGGCCCGGC 
GAGGANGCGG GGGAGCGGGC CCTCCTNTCC GAGAGGCTCT 
CTTGAGGGGG AGGAGAGGCT CCTTTGGCTT TACCAGGAGG 
GTCCTGGCCC ACATGGAGGC CACGGGGGTN CGGCTGGACG 
TCCCTGGAGG TGGCGGAGGA GATCCGCCGC CTCGAGGAGG 
CACCCCTTCA ACCTCAACTC CCGGGACCAG CTGGAAAGGG 
CTTCCCGCCA TCGGCAAGAC GGAGAAGACN GGCAAGCGCT 
GAGGCCCTNC GNGAGGC CCA CCCCATCGTG GAGAAGATCC 
AAGCTCAAGA AC AC CTAC AT NGACCCCCTG CCNGNCCTCG 
CTCCACACCC GCTTCAACCA GACGGCCACG GCCACGGGCA 
AACCTGCAGA ACATCCCCGT CCGCACCCCN CTGGGCCAGA 
GCCGAGGAGG GNTGGGTGTT GGTGGCCCTG GACTATAGCC 
GCCCACCTCT CCGGGGACGA GAACCTGATC CGGGTCTTCC 
ACCCAGACCG CCAGCTGGAT GTTCGGCGTC CCCCCGGAGG 
CGGGCGGCCA AGACCATCAA CTTCGGGGTC CTCTACGGCA 
CAGGAGCTTG CCATCCCCTA CGAGGAGGCG GTGGCCTTCA 
TTCCCCAAGG TGCGGGCCTG GATTGAGAAG ACCCTGGAGG 
GTGGAGACCC TCTTCGGCCG CCGGCGCTAC GTGCCCGACC 
GTGCGGGAGG CGGCGGAGCG CATGGCCTTC AACATGCCCG 
CTCATGAAGC TGGCCATGGT GAAGCTCTTC CCCCGGCTNC 
CTCCTNCAGG TCCACGACGA GCTGGTCCTC GAGGCCCCCA 
GCCGCTTTGG CCAAGGAGGT CATGGAGGGG GTCTATCCCC 
GAGGTGGGGA TGGGGGAGGA CTGGCTCTCC GCCAAGGAGT 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 3 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 



CCCTGGAGGA 
GCCCCGAGCC 
ACCGGGCACC 
CCAAGGACCT 
CCATGCTCCT 
GCTACGGGGG 
TCCNGAACCT 
TGGAGAAGCC 
TGGCCTACCT 
AGGTCTTCCG 
TGCTCTTTGA 
CCACCAGCGC 
TGCAGTACCG 
TCCACCCCAG 
GGCTTAGTAG 
GGATCCGCCG 
AGATAGAGCT 
AGGAGGGGAG 
CCGTGGACCC 
TGTCCGCCCA 
TTGAGCGCTA 
AGGGCAGGAG 
TCAACGCCCG 
TCCAGGGCAC 
AGGAAATGGG 
AAGAGCGGGC 
TGGCCGTGCC 
AG 



GGCCCCCTGG 
CATGTGGGCC 
AGACCCCTTT 
GGCCGTTTTG 
CGCCTACCTC 
GGAGTGGACG 
NNNGCAGCGC 
CCTTTCCCGG 
CCAGGCCCTN 
CCTGGCCGGC 
CGAGCTNGGG 
CGCCGTGCTG 
GGAGCTCACC 
GACGGGCCGC 
CTCCGACCCC 
GGCCTTCGTG 
CCGGGTCCTG 
GGACATCCAC 
CCTGATGCGC 
CCGCCTCTCC 
CTTCCAGAGC 
GCGGGGGTAC 
GGTGAAGAGC 
CGCCGCCGAC 
GGCCAGGATG 
GGAGGNGGTG 
CCTGGAGGTG 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2502 
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(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



Met Xaa 



Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 



10 



Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly 
20 25 



Leu Thr Thr Ser Arg Gly Glu Pro 



Val Gin Ala Val Tyr Gly Phe Ala 



35 



40 



45 



Lys 



Ser Leu Leu Lys Ala Leu Lys 



Glu Asp Gly Asp Ala Val Xaa Val 



50 



55 



60 



Val Phe Asp Ala Lys Ala Pro Ser 



Phe Arg His Glu Ala Tyr Glu Ala 



65 



70 



75 



80 



Tyr Lys Ala Gly Arg Ala Pro 



Thr Pro Glu Asp Phe Pro Arg Gin Leu 



85 



90 



95 



Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Xaa Arg Leu Glu 



100 



105 



Val Pro Gly Tyr Glu Ala Asp Asp 



Val Leu Ala Thr Leu Ala Lys Lys 



115 



120 



125 



Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg Asp 



130 



135 



Leu Tyr Gin Leu Leu Ser Asp Arg I 



le Ala Val Leu His Pro Glu Gly 



145 



Tyr 



150 



155 



160 



Leu He Thr Pro 
165 



Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro 



170 



Glu Gin Trp 



Val Asp Tyr Arg Ala Leu Xaa Gly Asp Pro Ser Asp Asn 



180 



185 



Leu 



Pro Gly Val Lys Gly He Gly Glu Lys 



195 



200 



Thr Ala Xaa Lys Leu Leu 
205 



Xaa Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu Asp Arg Val 



210 



215 



Lys 
225 



Pro Xaa Xaa Arg Glu Lys I 



le Xaa Ala His Met Glu Asp Leu Xaa 



230 



235 



240 



Leu Ser Xaa Xaa Leu Ser 



Xaa Val Arg Thr Asp Leu Pro Leu Glu Val 



245 



250 



255 



Asp Phe Ala Xaa Arg Arg 



Glu Pro Asp Arg Glu Gly Leu Arg Ala Phe 



260 



265 



270 



Leu Glu Arg Leu Glu Phe Gly Ser Leu 



Leu His Glu Phe Gly Leu Leu 



275 



280 



285 



Glu Xaa Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 



290 



295 



321 
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Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 



305 

Leu Leu Ala Leu Ala 
325 



310 



Ala Ala Arg Xaa Gly Arg Val His Arg Ala Xaa 



330 



Asp Pro Leu Xaa 



Gly Leu Arg Asp Leu Lys Glu Val Arg Gly Leu Leu 



340 



345 



Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly Leu Asp Leu Xaa 



355 



Pro Gly Asp Asp 
370 



360 



Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 



375 



380 



Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 



385 



390 



Asp Ala Gly Glu Arg Ala Leu 



Leu Ser Glu Arg Leu Phe Xaa Asn Leu 



405 



410 



415 



Xaa Xaa Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Xaa Glu 



420 



425 



Val Glu Lys 
435 



Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 

4 4b 



440 



Val Arg Leu Asp Val Ala Tyr Leu 



Arg 
450 



Gin Ala Leu Ser Leu Glu Val Ala 



455 



460 



Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His 



465 



470 



Pro Phe Asn Leu Asn Ser Arg Asp 



Gin Leu Glu Arg Val Leu Phe Asp 



485 



Glu Leu Gly Leu Pro 



490 



495 



Ala He Gly Lys Thr Glu Lys Thr Gly Lys Arg 



500 



505 



510 



Ser 



Thr Ser Ala Ala Val Leu Glu Ala Leu Arg 



515 



520 



Glu Ala His Pro He 
525 



Val Glu Lys He Leu Gin Tyr Arg 



Glu Leu Thr Lys Leu Lys Asn Thr 



530 



535 



540 



Tyr He Asp Pro Leu Pro Xaa Leu 



Val His Pro Arg Thr Gly Arg Leu 



545 



550 



555 



560 



His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 

D / U 



565 



Ser Asp 



Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 



580 



585 



Arg He Arg Arg Ala Phe 



Val Ala Glu Glu Gly Trp Xaa Leu Val Ala 



595 



600 



605 



Leu Asp Tyr Ser Gin He Glu Leu Arg 



Leu Arg Val Leu Ala His Leu Ser Gly 



610 



615 



620 



Asp Glu Asn Leu He Arg 



Val Phe Gin Glu Gly Arg Asp He His Thr 



625 



630 



635 



640 



322 
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,„ <- ni,^ riv vai Pro Pro Glu Ala Val Asp Pro 
Gin Thr Ala Ser Trp Met Phe Gly Val Pro pro ^ 

645 650 

L eu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr Gly 

660 66b 
Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu 

675 680 
Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 

690 695 
Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val 
705 710 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg 

725 730 
Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 

740 745 
Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 

755 760 
Phe Pro Arg Leu Xaa Glu Met Gly Ala Arg Met Leu Leu Gin Val His 

770 775 
Asp Glu Leu Val Leu Glu Ala Pro Lys Xaa Arg Ala Glu Xaa Val Ala 
785 790 

Ala Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val Pro 

805 810 
Leu Glu Val Glu Val Gly Xaa Gly Glu Asp Trp Leu Ser Ala Lys Glu 

- - - o 2, o 



820 



Xaa 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1647 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



ATGAATTCGG 


GGATGCTGCC 


CCTCTTTGAG 


CCCAAGGGCC 


GGGTCCTCCT 


GGTGGACGGC 


60 


CACCACCTGG 


CCTACCGCAC 


CTTCCACGCC 


CTGAAGGGCC 


TCACCACCAG 


CCGGGGGGAG 


120 


CCGGTGCAGG 


CGGTCTACGG 


CTTCGCCAAG 


AGCCTCCTCA 


AGGCCCTCAA 


GGAGGACGGG 


180 


GACGCGGTGA 


TCGTGGTCTT 


TGACGCCAAG 


GCCCCCTCCT 


TCCGCCACGA 


GGCCTACGGG 


240 


GGGTACAAGG 


CGGGCCGGGC 


CCCCACGCCG 


GAGGACTTTC 


CCCGGCAACT 


CGCCCTCATC 


300 


AAGGAGCTGG 


TGGACCTCCT 


GGGGC TGGCG 


CGCCTCGAGG 


TCCCGGGCTA 


CGAGGCGGAC 


360 
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GACGTCCTGG CCAGCCTGGC CAAGAAGGCG GAAAAGGAGG GCTACGAGGT CCGCATCCTC 
ACCGCCGACA AAGACCTTTA CCAGCTCCTT TCCGACCGCA TCCACGTCCT CCACCCCGAG 
GGGTACCTCA TCACCCCGGC CTGGCTTTGG GAAAAGTACG GCCTGAGGCC CGACCAGTGG 
GCCGACTACC GGGC C C TG AC CGGGGACGAG TCCGACAACC TTCCCGGGGT CAAGGGCATC 
GGGGAGAAGA CGGCGAGGAA GCTTCTGGAG GAGTGGGGGA GCCTGGAAGC CCTCCTCAAG 
^CCTGGACC GGCTGAAGCC CGCCATCCGG GAGAAGATCC TGGCCCACAT GGACGATCTG 
AAGCTCTCCT GGGACCTGGC CAAGGTGCGC ACCGACCTGC C C CTGGAGGT GGACTTCGCC 
AAAAGGCGGG AGCCCGACCG GGAGAGGCTT AGGGCCTTTC TGGAGAGGCT TGAGTTTGGC 
AGCCTCCTCC ACGAGTTCGG CCTTCTGGAA AGCCCCAAGG CCCTGGAGGA GGCCCCCTGG 
CCCCCGCCGG AAGGGGCCTT CGTGGGCTTT GTGCTTTCCC GCAAGGAGCC CATGTGGGCC 
GATCTTCTGG CCCTGGCCGC CGCCAGGGGG GGCCGGGTCC ACCGGGCCCC CGAGCCTTAT 
^GCCCTCA GGGAC CTGAA GGAGGCGCGG GGGCTTCTCG CCAAAGACCT GAGCGTTCTG 
GCCCTGAGGG AAGGCCTTGG CCTCCCGCCC GGCGACGACC CCATGCTCCT CGCCTACCTC 
CTGGACCCTT CCAACACCAC CCCCGAGGGG GTGGCCCGGC GCTACGGCGG GGAGTGGACG 
GAGGAGGCGG GGGAGCGGGC CGCCCTTTCC GAGAGGCTCT TCGCCAACCT GTGGGGGAGG 
CTTGAGGGGG AGGAGAGGCT CCTTTGGCTT TACCGGGAGG TGGAGAGGC C CCTTTCCGCT 
GTCCTGGCCC ACATGGAGGC CACGGGGGTG CGCCTGGACG TGGCCTATCT CAGGGCCTTG 
TCCCTGGAGG TGGCCGGGGA GATCGCCCGC CTCGAGGCCG AGGTCTTCCG CCTGGCCGGC 
CACCCCTTCA ACCTCAACTC C CGGGAC C AG CTGGAAAGGG TCCTCTTTGA CGAGCTAGGG 
CTTCCCGCCA TCGGCAAGAC GGAGAAGACC GGCAAGCGCT CCACCAGCGC CGCCGTCCTG 
GAGGCCCTCC GCGAGGCCCA CCCCATCGTG GAGAAGATCC TGCAGGCATG CAAGCTTGGC 
ACTGGCCGTC GTTTTACAAC GTCGTGA 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 
ATGAATTCGG GGATGCTGCC CCTCTTTGAG CCCAAGGGCC GGGTCCTCCT GGTGGACGGC 
CACCACCTGG CCTACCGCAC CTTCCACGCC CTGAAGGGCC TCACCACCAG CCGGGGGGAG 
CCGGTGCAGG CGGTCTACGG CTTCGCCAAG AGCCTCCTCA AGGCCCTCAA GGAGGACGGG 

- 324 - 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1647 



60 
120 
180 




GACGCGGTGA TCGTGGTCTT 
GGGTACAAGG CGGGCCGGGC 
AAGGAGCTGG TGGACCTCCT GGGGCTGGCG 



TGACGCCAAG GCCCCCTCCT TCCGCCACGA GGCCTACGGG 
CCCCACGCCG GAGGACTTTC CCCGGCAACT CGCCCTCATC 
CGCCTCGAGG TCCCGGGCTA CGAGGCGGAC 



GACGTCCTGG CCAGCCTGGC 
ACCGCCGACA AAGACCTTTA 
GGGTACCTCA TCACCCCGGC 
GCCGACTACC GGGCCCTGAC CGGGGACGAG 



CAAGAAGGCG GAAAAGGAGG GCTACGAGGT CCGCATCCTC 
CCAGCTCCTT TCCGACCGCA TCCACGTCCT CCACCCCGAG 
CTGGCTTTGG GAAAAGTACG GCCTGAGGCC CGACCAGTGG 



TCCGACAACC TTCCCGGGGT CAAGGGCATC 



GGGGAGAAGA CGGCGAGGAA 
AACCTGGACC GGCTGAAGCC CGCCATCCGG 



GCTTCTGGAG GAGTGGGGGA GCCTGGAAGC CCTCCTCAAG 



AAGCTCTCCT GGGACCTGGC 
AAAAGGCGGG AGCCCGACCG 
AGCCTCCTCC ACGAGTTCGG CCTTCTGGAA 



GAGAAGATCC TGGCCCACAT GGACGATCTG 
CAAGGTGCGC ACCGACCTGC CCCTGGAGGT GGACTTCGCC 
GGAGAGGCTT AGGGCCTTTC TGGAGAGGCT TGAGTTTGGC 



AGCCCCAAGG CCCTGGAGGA GGCCCCCTGG 



CCCCCGCCGG AAGGGGCCTT 
GATCTTCTGG CCCTGGCCGC 
AAAGCCCTCA GGGACCTGAA 
GCCCTGAGGG AAGGCCTTGG 



CGTGGGCTTT GTGCTTTCCC GCAAGGAGCC CATGTGGGCC 
CGCCAGGGGG GGCCGGGTCC ACCGGGCCCC CGAGCCTTAT 
GGAGGCGCGG GGGCTTCTCG CCAAAGACCT GAGCGTTCTG 
CCTCCCGCCC GGCGACGACC CCATGCTCCT CGCCTACCTC 



CTGGACCCTT CCAACACCAC CCCCGAGGGG 
GAGGAGGCGG GGGAGCGGGC CGCCCTTTCC 
CTTGAGGGGG AGGAGAGGCT CCTTTGGCTT 
GTCCTGGCCC ACATGGAGGC CACGGGGGTG 
TCCCTGGAGG TGGCCGGGGA GATCGCCCGC 



GTGGCCCGGC GCTACGGCGG GGAGTGGACG 
GAGAGGCTCT TCGCCAACCT GTGGGGGAGG 
TACCGGGAGG TGGAGAGGCC CCTTTCCGCT 
CGCCTGGACG TGGCCTATCT CAGGGCCTTG 
CTCGAGGCCG AGGTCTTCCG CCTGGCCGGC 



CACCCCTTCA ACCTCAACTC 
CTTCCCGCCA TCGGCAAGAC 
GAGGCCCTCC GCGAGGCCCA CCCCATCGTG 
AAGCTGAAGA GCACCTACAT TGACCCCTTG 
CTCCACACCC GCTTCAACCA GACGGCCACG 



CCGGGACCAG CTGGAAAGGG TCCTCTTTGA CGAGCTAGGG 
GGAGAAGACC GGCAAGCGCT CCACCAGCGC CGCCGTCCTG 



GAGAAGATCC TGCAGTACCG GGAGCTCACC 
CCGGACCTCA TCCACCCCAG GACGGGCCGC 
GCCACGGGCA GGCTAAGTAG CTCCGATCCC 



AACCTCCAGA ACATCCCCGT 
GCCGAGGAGG GGTGGCTATT 
GCCCACCTCT CCGGCGACGA GAACCTGATC 



CCGCACCCCG CTTGGGCAGA GGATCCGCCG GGCCTTCATC 
GGTGGCCCTG GACTATAGCC AGATAGAGCT CAGGGTGCTG 



CGGGTCTTCC AGGAGGGGCG GGACATCCAC 



ACGGAGACCG CCAGCTGGAT 
CGGGCGGCCA AG AC CAT C AA 



GTTCGGCGTC CCCCGGGAGG CCGTGGACCC CCTGATGCGC 
CTTCGGGGTC CTCTACGGCA TGTCGGCCCA CCGCCTCTCC 



240 



300 



360 



420 



480 



540 



600 



660 



720 



780 



840 



900 



960 



1020 



1080 



1140 



1200 



1260 



1320 



1380 



1440 



1500 



1560 



1620 



1680 



1740 



1800 



1860 



1920 



1980 



2040 
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CAGGAGCTAG CTAGCCATCC CTTACGAGGA GGCCCAGGCC 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 962 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ATGAATTCGG GGATGCTGCC CCTCTTTGAG CCCAAGGGCC 
CACCACCTGG CCTACCGCAC CTTCCACGCC CTGAAGGGCC 
CCGGTGCAGG CGGTCTACGG CTTCGCCAAG AGCCTCCTCA 
GACGCGGTGA TCGTGGTCTT TGACGCCAAG GCCCCCTCCT 
GGGTACAAGG CGGGCCGGGC CCCCACGCCG GAGGACTTTC 
AAGGAGCTGG TGGACCTCCT GGGGCTGGCG CGCCTCGAGG 
GACGTCCTGG CCAGCCTGGC CAAGAAGGCG GAAAAGGAGG 
ACCGCCGACA AAGACCTTTA CCAGCTTCTT TCCGACCGCA 
GGGTACCTCA TCACCCCGGC CTGGCTTTGG GAAAAGTACG 
GCCGACTACC GGGCCCTGAC CGGGGACGAG TCCGACAACC 
GGGGAGAAGA CGGCGAGGAA GCTTCTGGAG GAGTGGGGGA 
AACCTGGACC GGCTGAAGCC CGCCATCCGG GAGAAGATCC 
AAGCTCTCCT GGGACCTGGC CAAGGTGCGC ACCGACCTGC 
AAAAGGCGGG AGCCCGACCG GGAGAGGCTT AGGGCCTTTC 
AGCCTCCTCC ACGAGTTCGG CCTTCTGGAA AGCCCCAAGT 
TGGCCGTGCC CCTGGAGGTG GAGGTGGGGA TAGGGGAGGA 
GA 

(2) INFORMATION FOR SEQ ID NO : 12 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 0 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
ATGGAATTCG GGGATGCTGC CCCTCTTTGA GCCCAAGGGC 



TTCATTGA 



2088 



GGGTCCTCCT 
TCACCACCAG 
AGGCCCTCAA 
TCCGCCACGA 
CCCGGCAACT 
TCCCGGGCTA 
GCTACGAGGT 
TCCACGTCCT 
GCCTGAGGCC 
TTCCCGGGGT 
GCCTGGAAGC 
TGGCCCACAT 
CCCTGGAGGT 
TGGAGAGGCT 
CATGGAGGGG 
CTGGCTCTCC 



GGTGGACGGC 
CCGGGGGGAG 
GGAGGACGGG 
GGCCTACGGG 
CGCCCTCATC 
CGAGGCGGAC 
CCGCATCCTC 
CCACCCCGAG 
CGACCAGTGG 
CAAGGGCATC 
CCTCCTCAAG 
GGACGATCTG 
GGACTTCGCC 
TGAGTTTGGC 
GTGTATCCCC 
GCCAAGGAGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
962 



CGGGTCCTCC TGGTGGACGG 



60 
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CCACCACCTG GCCTACCGCA CCTTCCACGC CCTGAAGGGC 
GCCGGTGCAG GCGGTCTACG GCTTCGCCAA GAGCCTCCTC 
GGACGCGGTG ATCGTGGTCT TTGACGCCAA GGCCCCCTCC 
GGGGTACAAG GCGGGCCGGG CCCCCACGCC GGAGGACTTT 
CAAGGAGCTG GTGGACCTCC TGGGGCTGGC GCGCCTCGAG 
CGACGTCCTG GCCAGCCTGG CCAAGAAGGC GGAAAAGGAG 
CACCGCCGAC AAAGACCTTT ACCAGCTCCT TTCCGACCGC 
GGGGTACCTC ATCACCCCGG CCTGGCTTTG GGAAAAGTAC 
GGCCGACTAC CGGGCCCTGA CCGGGGACGA GTCCGACAAC 
CGGGGAGAAG ACGGCGAGGA AGCTTCTGGA GGAGTGGGGG 
GAACCTGGAC CGGCTGAAGC CCGCCATCCG GGAGAAGATC 
GAAGCTCTCC TGGGACCTGG CCAAGGTGCG CACCGACCTG 
CAAAAGGCGG GAGCCCGACC GGGAGAGGCT TAGGGCCTTT 
GAGCCTCCTC CACGAGTTCG GCCTTCTGGA AAGCCCCAAG 
CGAGGAGGGG TGGCTATTGG TGGCCCTGGA CTATAGCCAG 
CCACCTCTCC GGCGACGAGA ACCTGATCCG GGTCTTCCAG 
GGAGACCGCC AGCTGGATGT TCGGCGTCCC CCGGGAGGCC 
GGCGGCCAAG ACCATCAACT TCGGGGTCCT CTACGGCATG 
GGAGCTAGCC ATCCCTTACG AGGAGGCCCA GGCCTTCATT 
CCCCAAGGTG CGGGCCTGGA TTGAGAAGAC CCTGGAGGAG 
GGAGACCCTC TTCGGCCGCC GCCGCTACGT GCCAGACCTA 
GCGGGAGGCG GCCGAGCGCA TGGCCTTCAA CATGCCCGTC 
CATGAAGCTG GCTATGGTGA AGCTCTTCCC CAGGCTGGAG 
CCTTCAGGTC CACGACGAGC TGGTCCTCGA GGCCCCAAAA 
CCGGCTGGCC AAGGAGGTCA TGGAGGGGGT GTATCCCCTG 
GGTGGGGATA GGGGAGGACT GGCTCTCCGC CAAGGAGTGA 
(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



CTCACCACCA 
AAGGCCCTCA 
TTCCGCCACG 
CCCCGGCAAC 
GTCCCGGGCT 
GGCTACGAGG 
ATCCACGTCC 
GGCCTGAGGC 
CTTCCCGGGG 
AGCCTGGAAG 
CTGGCCCACA 
CCCCTGGAGG 
CTGGAGAGGC 
ATCCGCCGGG 
ATAGAGCTCA 
GAGGGGCGGG 
GTGGACCCCC 
TCGGCCCACC 
GAGCGCTACT 
GGCAGGAGGC 
GAGGCCCGGG 
CGGGGCACCG 
GAAATGGGGG 
GAGAGGGCGG 
GCCGTGCCCC 



GCCGGGGGGA 12 0 

AGGAGGACGG 18 0 

AGGCCTACGG 24 0 

TCGCCCTCAT 3 00 

ACGAGGCGGA 36 0 

TCCGCATCCT 42 0 

TCCACCCCGA 480 

CCGACCAGTG 54 0 

TCAAGGGCAT 60 0 

CCCTCCTCAA 660 

TGGACGATCT 72 0 
TGGACTTCGC 7 8 0 

TTGAGTTTGG 84 0 

CCTTCATCGC 9 00 

GGGTGCTGGC 960 

ACATCCACAC 102 0 

TGATGCGCCG 10 80 

GCCTCTCCCA 1140 

TTCAGAGCTT 12 0 0 

GGGGGT AC GT 1260 

TGAAGAGCGT 132 0 

CCGCCGACCT 13 80 
CCAGGATGCT 144 0 

AGGCCGTGGC 1500 
TGGAGGTGGA 1560 
1600 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CACGAATTCG GGGATGCTGC CCCTCTTTGA GCCCAA 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 
GTGAGATCTA TCACTCCTTG GCGGAGAGCC AGTC 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 15: 
T AATACGAC T CACTATAGGG AGAC CGGAAT TCGAGCTCGC CCGGGCGAGC TCGAATTCCG 
TGTATTCTAT AGTGTCACCT AAATCGAATT C 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TAATACGACT CACTATAGGG 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 
GAATTCGATT TAGGTGACAC TATAGAA 
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(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

31 

GTAATCATGG TCATAGCTGG TAGCTTGCTA C 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GGATCCTCTA GAGTCGACCT GCAGGCATGC CTACCTTGGT AG 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 20: 

GGATCCTCTA GAGTCGACCT GCAGGCATGC 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

ATGAATT CGG GGATGCTGCC CCTCTTTGAG CCCAAGGGCC GGGTCCTCCT GGTGGACGGC 
CACCACCTGG CCTACCGCAC CTTCCACGCC CTGAAGGGCC TCACCACCAG C CGGGGGGAG 
CCGGTGCAGG CGGTCTACGG CTTCGCCAAG AGCCTCCTCA AGGCCCTCAA GGAGGACGGG 
GACGCGGTGA TCGTGGTCTT TGACGCCAAG GCCCCCTCCT TCCGCCACGA GGCCTACGGG 
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GGGTACAAGG CGGGCCGGGC CCCCACGCCG GAGGACTTTC CCCGGCAACT CGCCCTCATC 
AAGGAGCTGG TGGACCTCCT GGGGCTGGCG CGCCTCGAGG TCCCGGGCTA CGAGGCGGAC 
GACGTCCTGG CCAGCCTGGC CAAGAAGGCG GAAAAGGAGG GCTACGAGGT CCGCATCCTC 
ACCGCCGACA AAGACCTTTA CCAGCTCCTT TCCGACCGCA TCCACGTCCT CCACCCCGAG 
GGGTACCTCA TCACCCCGGC CTGGCTTTGG GAAAAGTACG GCCTGAGGCC CGACCAGTGG 
GCCGACTACC GGGCCCTGAC CGGGGACGAG TCCGACAACC TTCCCGGGGT CAAGGGCATC 
GGGGAGAAGA CGGCGAGGAA GCTTCTGGAG GAGTGGGGGA GCCTGGAAGC CCTCCTCAAG 
AACCTGGACC GGCTGAAGCC CGCCATCCGG G AG AAG AT C C TGGCCCACAT GGACGATCTG 
AAGCTCTCCT GGGACCTGGC CAAGGTGCGC ACCGACCTGC CCCTGGAGGT GGACTTCGCC 
AAAAGGC GGG AGCCCGACCG GGAGAGGC TT AGGGCCTTTC TGGAGAGGCT TGAGTTTGGC 
AGCCTCCTCC ACGAGTTCGG CCTTCTGGAA AGCCCCAAGG CCCTGGAGGA GGCCCCCTGG 
CCCCCGCCGG AAGGGGC CTT CGTGGGCTTT GTGCTTTCCC GCAAGGAGCC CATGTGGGCC 
GATCTTCTGG CCCTGGCCGC CGCCAGGGGG GGCCGGGTCC ACCGGGCCCC CGAGC CTT AT 
AAAGCCCTCA GGGAC CTGAA GGAGGCGCGG GGGCTTCTCG CCAAAGACCT GAGCGTTCTG 
GCCCTGAGGG AAGGCCTTGG CCTCCCGCCC GGCGACGACC CCATGCTCCT CGCCTACCTC 
CTGGACCCTT CCAACACCAC CCCCGAGGGG GTGGCCCGGC GCTACGGCGG GGAGTGGACG 
GAGGAGGCGG GGGAGCGGGC CGCCCTTTCC GAGAGGCTCT TCGCCAACCT GTGGGGGAGG 
CTTGAGGGGG AGGAGAGGCT CCTTTGGCTT TACCGGGAGG TGGAGAGGCC CCTTTCCGCT 
GTCCTGGCCC ACATGGAGGC CACGGGGGTG CGCCTGGACG TGGCCTATCT CAGGGCCTTG 
T C C C TGGAGG TGGCCGGGGA GATCGCCCGC CTCGAGGCCG AGGTCTTCCG CCTGGCCGGC 
CACCCCTTCA ACCTCAACTC CCGGGACCAG CTGGAAAGGG TCCTCTTTGA CGAGCTAGGG 
CTTCCCGCCA TCGGCAAGAC GG AGAAG AC C GGCAAGCGCT CCACCAGCGC CGCCGTCCTG 
GAGGCCCTCC GCGAGGCCCA CCCCATCGTG GAGAAGATC C TGCAGTACCG GGAGCTCACC 
AAGCTGAAGA GCACCTACAT TGACCCCTTG CCGGACCTCA TCCACCCCAG GACGGGCCGC 
CTCCACACCC GCTTCAACCA GACGGCCACG GCCACGGGCA GGC T AAGT AG CTCCGATCCC 
AAC C T C C AG A ACATCCCCGT CCGCACCCCG CTTGGGCAGA GGATCCGCCG GGCCTTCATC 
GCCGAGGAGG GGTGGCTATT GGTGGCCCTG GACTATAGCC AGATAGAGCT CAGGGTGCTG 
GCCCACCTCT CCGGCGACGA GAACCTGATC CGGGTCTTCC AGGAGGGGCG GGACATCCAC 
ACGGAGACCG CCAGCTGGAT GTTCGGCGTC CCCCGGGAGG CCGTGGACCC CCTGATGCGC 
CGGGCGGCCA AGACCATCAA CTTCGGGGTC CTCTACGGCA TGTCGGCCCA CCGCCTCTCC 
CAGGAGCTAG CCATCCCTTA CGAGGAGGC C CAGGCCTTCA TTGAGCGCTA CTTTCAGAGC 
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TTCCCCAAGG TGCGGGCCTG GATTGAGAAG AC C C TGG AGG AGGGCAGGAG GCGGGGGTAC 
GTGGAGACCC TCTTCGGCCG CCGCCGCTAC GTGCCAGACC TAGAGGCCCG GGTGAAGAGC 
GTGCGGGAGG CGGCCGAGCG CATGGCCTTC AACATGCCCG TCCGGGGCAC CGCCGCCGAC 
CTCATGAAGC TGGCTATGGT GAAGCTCTTC CCCAGGCTGG AGGAAATGGG GGCCAGGATG 
CTCCTTCAGG TCCACGACGA GCTGGTCCTC GAGGCCCCAA AAGAGAGGGC GGAGGCCGTG 
GCCCGGCTGG CCAAGGAGGT CATGGAGGGG GTGTATCCCC TGGCCGTGCC CCTGGAGGTG 
GAGGTGGGGA TAGGGGAGGA CTGGCTCTCC GCCAAGGAGT GA 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
GATTTAGGTG ACACTATAG 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CGGACGAACA AGCGAGACAG CGACACAGGT AC C AC AT GGT ACAAGAGGCA AG AG AG AC G A 

CACAGCAGAA AC 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
GTTTCTGCTG TGTCGTCTCT CTTGCCTCTT GT AC C ATGTG GTACCTGTGT CGCTGTCTCG 

CTTGTTCGTC 

(2) INFORMATION FOR SEQ ID NO: 25: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

20 

GACGAACAAG CGAGACAGCG 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

24 

GTTTCTGCTG TGTCGTCTCT CTTG 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
CCTCTTGTAC CATGTGGTAC CTGTGTCGCT GTCTCGCTTG TTCGTC 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
ACACAGGTAC CACATGGTAC AAGAGGCAAG AGAGACGACA CAGCAGAAAC 5 0 

(2) INFORMATION FOR SEQ ID NO : 2 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg He Asn Ser 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 969 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO-.30: 
ATGGCTAGCA TGACTGGTGG ACAGCAAATG GGTCGGATCA ATTCGGGGAT GCTGCCCCTC 
TTTGAGCCCA AGGGC CGGGT CCTCCTGGTG GACGGCCACC ACCTGGCCTA CCGCACCTTC 
CACGCCCTGA AGGGCCTCAC CACCAGCCGG GGGGAGCCGG TGCAGGCGGT CTACGGCTTC 
GCCAAGAGCC TCCTCAAGGC CCTCAAGGAG GACGGGGACG CGGTGATCGT GGTCTTTGAC 
GCCAAGGCCC CCTCCTTCCG CCACGAGGCC TACGGGGGGT ACAAGGCGGG CCGGGCCCCC 
ACGCCGGAGG ACTTTCCCCG GCAACTCGCC CTCATCAAGG AGCTGGTGGA CCTCCTGGGG 
CTGGCGCGCC TCGAGGTCCC GGGCTACGAG GCGGACGACG TCCTGGCCAG CCTGGCCAAG 
AAGGCGGAAA AGGAGGGCTA CGAGGTCCGC ATCCTCACCG CCGACAAAGA CCTTTACCAG 
CTTCTTTCCG ACCGCATCCA CGTCCTCCAC CCCGAGGGGT ACCTCATCAC CCCGGCCTGG 
CTTTGGGAAA AGTACGGCCT GAGGCCCGAC CAGTGGGCCG ACTACCGGGC CCTGACCGGG 
GACGAGTCCG ACAACCTTCC CGGGGTCAAG GGCATCGGGG AGAAGACGGC GAGGAAGCTT 
CTGGAGGAGT GGGGGAGCCT GGAAGCCCTC CTCAAGAACC TGGACCGGCT GAAGCCCGCC 
ATCCGGGAGA AGATCCTGGC CCACATGGAC GATC TGAAGC TCTCCTGGGA CCTGGCCAAG 
GTGCGCACCG ACCTGCCCCT GGAGGTGGAC TTCGCCAAAA GGCGGGAGCC CGACCGGGAG 
AGGC TTAGGG CCTTTCTGGA GAGGCTTGAG TTTGGCAGCC TCCTCCACGA GTTCGGCCTT 
CTGGAAAGCC CCAAGTCATG GAGGGGGTGT ATCCCCTGGC CGTGCCCCTG GAGGTGGAGG 
TGGGGATAG 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 
ATGGCTAGCA TGAC TGGTGG ACAGCAAATG GGTCGGATCA 
TTTGAGCCCA AGGGCCGGGT CCTCCTGGTG GACGGCCACC 
CACGCCCTGA AGGGCCTCAC CACCAGCCGG GGGGAGCCGG 
GCCAAGAGCC TCCTCAAGGC CCTCAAGGAG GACGGGGACG 
GCCAAGGCCC CCTCCTTCCG CCACGAGGCC TACGGGGGGT 
ACGCCGGAGG ACTTTCCCCG GCAACTCGCC CTCATCAAGG 
CTGGCGCGCC TCGAGGTCCC GGGCTACGAG GCGGACGACG 
AAGGCGGAAA AGGAGGGCTA CGAGGTCCGC ATCCTCACCG 
CTTCTTTCCG ACCGCATCCA CGTCCTCCAC CCCGAGGGGT 
CTTTGGGAAA AGTACGGCCT GAGGCCCGAC CAGTGGGCCG 
GACGAGTCCG ACAACCTTCC CGGGGTCAAG GGCATCGGGG 
CTGGAGGAGT GGGGGAGCCT GGAAGCCCTC CTCAAGAACC 
ATCCGGGAGA AGATCCTGGC CCACATGGAC GATCTGAAGC 
GTGCGCACCG ACCTGCCCCT GGAGGTGGAC TTCGCCAAAA 
AGGCTTAGGG CCTTTCTGGA GAGGCTTGAG TTTGGCAGCC 
CTGGAAAGCC CCAAGGCCGC ACTCGAGCAC CACCACCACC 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 206 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CGCCAGGGTT TTCCCAGTCA CGACGTTGTA AAACGACGGC 
CACTATAGGG CGAATTCGAG CTCGGTACCC GGGGATCCTC 
GCAAGCTTGA GTATTCTATA GTGTCACCTA AATAGCTTGG 
TTTCCTGTGT GAAATTGTTA TCCGCT 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

TTCTGGGTTC TCTGCTCTCT GGT CGCTGTC TCGCTTGTTC GTC 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GCTGTCTCGC TTGTTCGTC 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
GACGAACAAG CGAGACAGCG 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:36: 
TTCTGGGTTC TCTGCTCTCT GGTC 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
GACGAACAAG CGAGACAGCG ACCAGAGAGC AGAGAACCCA GAA 
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(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
ACCAGAGAGC AG AGAAC CCA GAA 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
AACAGCTATG ACCATGATTA C 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
CACCGTCCTC TTCAAGAAGT TTATCCAGAA GCCAATGCAC C C ATTGGAC A TAACCGGGAA 
TCCTACATGG TTCCTTTTAT ACCACTGTAC AGAAATGGTG ATTTCTTTAT TTCATCCAAA 
GATCTGGGCT ATGACTATAG CTATCTACAA GATTCAG 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
CACCGTCCTC TTCAAGAAGT TTATCCAGAA GCCAATGCAC CCATTAGACA TAACCGGGAA 
TCCTACATGG TTCCTTTTAT ACCACTGTAC AGAAATGGTG ATTTCTTTAT TTCATCCAAA 
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GATCTGGGCT ATGACTATAG CTATCTACAA GATTCAG 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 
CACCGTCCTC TTCAAGAAG 
(2) INFORMATION FOR SEQ ID NO-.43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
CTGAATCTTG TAGATAGCTA 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 339 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 
GCCTTATTTT ACTTTAAAAA TTTTCAAATG TTTCTTTTAT ACACAATATG 
TGAATAACCT TTTCCTCTGC AGTATTTTTG AGCAGTGGCT CCGAAGGCAC 
AAGAAGTTTA TCCAGAAGCC AATGC AC CCA TTAGACATAA CCGGGAATCC 
CTTTTATACC ACTGTACAGA AATGGTGATT TCTTTATTTC AT C C AAAGAT 
ACTATAGCTA TCTACAAGAT TCAGGTAAAG TTTACTTTCT TTCAGAGGAA 
TAGTGTTACC AATTTATTTT GAGATAACAC AAAACTTTA 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
GCCTTATTTT ACTTTAAAAA T 
(2) INFORMATION FOR SEQ ID NO:46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 
TAAAGTTTTG TGTTATCTCA 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
CACCGTCCTC TTCAAGAAGT TTATCCAGAA GCCAATGCAC C C AT TGGAC A TAACCGGGAA 
TCCTACATGG TTCCTTTTAT ACCACTGTAC AGAAATGGTG ATTTCTTTAT TTCATCCAAA 
GATCTGGGCT ATGACTATAG CTATCTACAA GATTCAG 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
CTGAATCTTG TAGATAGCTA TAGTCATAGC CCAGATCTTT GGATGAAATA AAGAAAT C AC 
CATTTCTGTA CAGTGGTATA AAAGGAACCA TGTAGGATTC CCGGTTATGT CCAATGGGTG 
CATTGGCTTC TGGATAAACT TCTTGAAGAG GACGGTG 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 165 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE : DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
AGC GGAT AAC AATTTCACAC AGGAAACAGC TATGACCATG ATTACGCCAA GCTATTTAGG 
TGACACTATA GAATACTCAA GCTTGCATGC CTGCAGGTCG ACTCTAGAGG ATCCCCGGGT 
ACCGAGCTCG AATTCGCCCT ATAGTGAGTC GTATTAGGAT CCGTG 
(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 206 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
CGC C AGGGTT TTCCCAGTCA CGACGTTGTA AAACGACGGC CAGTGAATTG TAATACGACT 
CACTATAGGG CGAATTCGAG CTCGGTACCC GGGGATCCTC TAGAGTCGAC CTGCAGGCAT 
GCAAGCTTGA GTATTCTATA GTGTCACCTA AATAGCTTGG CGTAATCATG GTCATAGCTG 
TTTCCTGTGT GAAATTGTTA TCCGCT 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
AGCGGATAAC AATTTCACAC AGGA 
(2) INFORMATION FOR SEQ ID NO:52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 
CACGGATCCT AATACGACTC ACTATAGGG 
(2) INFORMATION FOR SEQ ID NO: 53: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

CGCCAGGGTT TTCCCAGTCA CGAC 

(2) INFORMATION FOR SEQ ID NO:54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
CACCGTCCTC TTCAAGAAGT TTATCCAGAA GCCAATGCAC 
TCCTACATGG TTCCTTTTAT ACCACTGTAC AGAAATGGTG 
GATCTGGGCT ATGAC TAT AG CTATCTACAA GATTCAG 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
CACCGTCCTC TTCAAGAAGT TTATCCAGAA GCCAATGCAC 
TCCTACATGG TTCCTTTTAT ACCACTGTAC AGAAATGGTG 
GATCTGGGCT ATGACTATAG CTATCTACAA GATTCAG 
(2) INFORMATION FOR SEQ ID NO:56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
CACCGTCCTC TTCAAGAAGT TTATCCAGAA GCCAATGCAC 
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CCATTAGACA TAACCGGGAA 
ATTTCTTTAT TTC ATC C AAA 
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TCCTACATGG TTCCTTTTAT ACCACTGTAC AGAAATGGTG ATTTCTTTAT TTCATCCAAA 
GATCTGGGCT ATGACTATAG CTATCTACAA GATTCAGACC CAGACTCTTT TCAAGACTAC 
ATTAAGTCCT ATTTGGAACA AGCGAGTCGG ATCTGGTCAT GGCTCCTTGG GGCGGCGATG 
GTAGGGGCCG TCCTCACTGC CCTGCTGGCA GGGCTTGTGA GCTTGCTGTG TCGTCACAAG 
AGAAAGCAGC TTCCTGAAGA AAAGCAGCCA CTCCTCATGG AGAAAGAGGA TTACCACAGC 
TTGTATCAGA GCCATTTA 
(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
CACCGTCCTC TTCAAGAAGT TTATCCAGAA GCCAATGCAC CCATTGGACA TAACCAGGAA 
TCCTACATGG TTCCTTTTAT ACCACTGTAC AGAAATGGTG ATTTCTTTAT TTCATCCAAA 
GATCTGGGCT ATGACTATAG CTATCTACAA GATTCAGACC CAGACTCTTT TCAAGACTAC 
ATTAAGTCCT ATTTGGAACA AGCGAGTCGG ATCTGGTCAT GGCTCCTTGG GGCGGCGATG 
GTAGGGGCCG TCCTCACTGC CCTGCTGGCA GGGCTTGTGA GCTTGCTGTG TCGTCACAAG 
AGAAAGCAGC TTCCTGAAGA AAAGCAGCCA CTCCTCATGG AGAAAGAGGA TTACCACAGC 
TTGTATCAGA GCCATTTA 
(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1059 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
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GCAAGTTTGG 


CTTTTGGGGA 


CCAAACTGCA 


CAGAGAGACG 


ACTCTTGGTG 


AGAAGAAACA 
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TCTTCGATTT 


GAGTGCCCCA 


GAGAAGGACA 


AATTTTTTGC 


CTACCTCACT 


TTAGCAAAGC 
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AT AC CAT C AG 


CTCAGACTAT 


GTCATCCCCA 


TAGGGACCTA 


TGGCCAAATG 


AAAAATGGAT 


180 


CAACACCCAT 


GTTTAACGAC 


ATCAATATTT 


ATGACCTCTT 


TGTCTGGATG 


CATTATTATG 


240 


TGTCAATGGA 


TGCACTGCTT 


GGGGGATATG 


AAATCTGGAG 


AGACATTGAT 


TTTGCCCATG 


300 


AAGCACCAGC 


TTTTCTGCCT 


TGGCATAGAC 


TCTTCTTGTT 


GCGGTGGGAA 


CAAGAAATCC 


360 
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AGAAGCTGAC AGGAGATGAA AACTTCACTA TTCCATATTG 
AGTGTGACAT TTGCACAGAT GAGTACATGG GAGGTCAGCA 
TCAGCCCAGC ATCATTCTTC TCCTCTTGGC AGATTGTCTG 
ACAGCCATCA GTCTTTATGC AATGGAACGC CCGAGGGACC 
ACCATGACAA ATCCAGAACC CCAAGGCTCC CCTCTTCAGC 
GTTTGACCCA ATATGAATCT GGTTCCATGG ATAAAGCTGC 
CACTGGAAGG ATTTGCTAGT CCACTTACTG GGATAGCGGA 
ACAATGCCTT GCACATCTAT ATGAATGGAA CAATGTCCCA 
ATCCTATCTT CCTTCTTCAC CATGCATTTG TTGACAGTAT 
GGCACCGTCC TCTTCAAGAA GTTTATCCAG AAGCCAATGC 
AATCCTACAT GGTTCCTTTT ATACCACTGT ACAGAAATGG 
AAGATCTGGG CTATGACTAT AGCTATCTAC AAGATTCAG 
(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59 
GCAAGTTTGG CTTTTGGGGA CCAAACTGCA CAGAGAGACG 
TCTTCGATTT GAGTGCCCCA GAGAAGGACA AATTTTTTGC 
AT AC CATC AG CTCAGACTAT GTCATCCCCA TAGGGACCTA 
CAACACCCAT GTTTAACGAC ATCAATATTT ATGACCTCTT 
TGTCAATGGA TGCACTGCTT GGGGGATATG AAATCTGGAG 
AAGCACCAGC TTTTCTGCCT TGGCATAGAC TCTTCTTGTT 
AGAAGCTGAC AGGAGATGAA AACTTCACTA TTCCATATTG 
AGTGTGACAT TTGCACAGAT GAGTACATGG GAGGTCAGCA 
TCAGCCCAGC ATCATTCTTC TCCTCTTGGC AGATTGTCTG 
ACAGCCATCA GTCTTTATGC AATGGAACGC CCGAGGGACC 
ACCATGACAA ATCCAGAACC CCAAGGCTCC CCTCTTCAGC 
GTTTGACCCA ATATGAATCT GGTTCCATGG ATAAAGCTGC 
CACTGGAAGG ATTTGCTAGT CCACTTACTG GGATAGCGGA 



GGACTGGCGG 
CCCCACAAAT 
TAGCCGATTG 
TTTACGGCGT 
TGATGTAGAA 
CAATTTCAGC 
TGCCTCTCAA 
GGTACAGGGA 
TTTTGAGCAG 
ACCCATTGGA 
TGATTTCTTT 



GATGCAGAAA 

CCTAACTTAC 

GAGGAGTACA 

AATCCTGGAA 

TTTTGCCTGA 

TTTAGAAATA 

AGCAGCATGC 

TCTGCCAACG 

TGGCTCCGAA 

CATAACCGGG 

ATTTCATCCA 



ACTCTTGGTG 

CTACCTCACT 

TGGCCAAATG 

TGTCTGGATG 

AGACATTGAT 

GCGGTGGGAA 

GGACTGGCGG 

CCCCACAAAT 

TAGCCGATTG 

TTTACGGCGT 

TGATGTAGAA 

CAATTTCAGC 

TGCCTCTCAA 



AGAAGAAACA 

TTAGCAAAGC 

AAAAATGGAT 

CATTATTATG 

TTTGCCCATG 

CAAGAAATCC 

GATGCAGAAA 

CCTAACTTAC 

GAGGAGTACA 

AATCCTGGAA 

TTTTGCCTGA 

TTTAGAAATA 

AGCAGCATGC 
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ACAATGCCTT GCACATCTAT ATGAATGGAA CAATGTCCCA GGTACAGGGA TCTGCCAACG 
ATCCTATCTT CCTTCTTCAC CATGCATTTG TTGACAGTAT TTTTGAGCAG TGGCTCCGAA 
GGCACCGTCC TCTTCAAGAA GTTTATCCAG AAGCCAATGC ACCCATTGGA CATAACCAGG 
AATCCTACAT GGTTCCTTTT ATACCACTGT ACAGAAATGG TGATTTCTTT ATTTCATCCA 
AAGATCTGGG CTATGACTAT AGCTATCTAC AAGATTCAG 
(2) INFORMATION FOR SEQ ID NO:60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1587 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
ATGCTCCTGG CTGTTTTGTA CTGCCTGCTG TGGAGTTTCC AGACCTCCGC TGGCCATTTC 
CCTAGAGCCT GTGTCTCCTC TAAGAACCTG ATGGAGAAGG AATGCTGTCC ACCGTGGAGC 
GGGGACAGGA GTCCCTGTGG CCAGCTTTCA GGCAGAGGTT CCTGTCAGAA TATCCTTCTG 
TCCAATGCAC CACTTGGGCC TCAATTTCCC TTCACAGGGG TGGATGACCG GGAGTCGTGG 
CCTTCCGTCT TTTATAATAG GACCTGCCAG TGCTCTGGCA ACTTCATGGG ATTCAACTGT 
GGAAACTGCA AGTTTGGCTT TTGGGGACCA AACTGCACAG AGAGACGACT CTTGGTGAGA 
AGAAACATCT TCGATTTGAG TGCCCCAGAG AAGGACAAAT TTTTTGCCTA CCTCACTTTA 
GCAAAGCATA CCATCAGCTC AGACTATGTC ATCCCCATAG GGACCTATGG C C AAATGAAA 
AATGGATCAA CACCCATGTT TAACGACATC AATATTTATG ACCTCTTTGT CTGGATGCAT 
TATTATGTGT CAATGGATGC ACTGCTTGGG GGATATGAAA TCTGGAGAGA CATTGATTTT 
GCCCATGAAG CACCAGCTTT TCTGCCTTGG CATAGACTCT TCTTGTTGCG GTGGGAACAA 
GAAATCCAGA AGCTGACAGG AGATGAAAAC TTCACTATTC CATATTGGGA CTGGCGGGAT 
GCAGAAAAGT GTGACATTTG CACAGATGAG TACATGGGAG GTCAGCACCC CACAAATCCT 
AACTTACTCA GCCCAGCATC ATTCTTCTCC TCTTGGCAGA TTGTCTGTAG CCGATTGGAG 
GAGTACAACA GCCATCAGTC TTTATGCAAT GGAACGCCCG AGGGACCTTT ACGGCGTAAT 
CCTGGAAACC ATGACAAATC CAGAACCCCA AGGCTCCCCT CTTCAGCTGA TGTAGAATTT 
TGCCTGAGTT TGACCCAATA TGAATCTGGT TCCATGGATA AAGCTGCCAA TTTCAGCTTT 
AGAAATACAC TGGAAGGATT TGCTAGTCCA CTTACTGGGA TAGCGGATGC CTCTCAAAGC 
AGCATGCACA ATGCCTTGCA CATCTATATG AATGGAACAA TGTCCCAGGT ACAGGGATCT 
GCCAACGATC CTATCTTCCT TCTTCACCAT GCATTTGTTG ACAGTATTTT TGAGCAGTGG 
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CTCCGAAGGC ACCGTCCTCT TCAAGAAGTT TATCCAGAAG 
AACCGGGAAT CCTACATGGT TCCTTTTATA CCACTGTACA 
TCATCCAAAG ATCTGGGCTA TGACTATAGC TATCTACAAG 
CAAGACTACA TTAAGTCCTA TTTGGAACAA GCGAGTCGGA 
GCGGCGATGG TAGGGGCCGT CCTCACTGCC CTGCTGGCAG 
CGTCACAAGA GAAAGCAGCT TCCTGAAGAA AAGCAGCCAC 
TACCACAGCT TGTATCAGAG CCATTTA 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61 
ATGCTCCTGG CTGTTTTGTA CTGCCTGCTG TGGAGTTTCC 
CCTAGAGCCT GTGTCTCCTC TAAGAACCTG ATGGAGAAGG 
GGGGACAGGA GTCCCTGTGG CCAGCTTTCA GGCAGAGGTT 
TCCAATGCAC CACTTGGGCC TCAATTTCCC TTCACAGGGG 
CCTTCCGTCT TTTATAATAG GACCTGCCAG TGCTCTGGCA 
GGAAACTGCA AGTTTGGCTT TTGGGGAC C A AACTGCACAG 
AGAAACATCT TCGATTTGAG TGCCCCAGAG AAGGACAAAT 
GCAAAGCATA CCATCAGCTC AGACTATGTC AT C C C CAT AG 
AATGGATCAA CACCCATGTT T AACGAC AT C AATATTTATG 
TATTATGTGT CAATGGATGC ACTGCTTGGG GGATATGAAA 
GCCCATGAAG C AC C AGCTTT TCTGCCTTGG CATAGACTCT 
GAAATCCAGA AGCTGACAGG AGATGAAAAC TTCACTATTC 
GCAGAAAAGT GTGACATTTG CACAGATGAG TACATGGGAG 
AACTTACTCA GCCCAGCATC ATTCTTCTCC TCTTGGCAGA 
GAGTACAACA GCCATCAGTC TTTATGCAAT GGAACGCCCG 
CCTGGAAACC ATGAC AAAT C CAGAACCCCA AGGCTCCCCT 
TGCCTGAGTT TGACCCAATA TGAATCTGGT TCCATGGATA 
AGAAATACAC TGGAAGGATT TGCTAGTCCA CTTACTGGGA 



CCAATGCACC 
GAAATGGTGA 
ATTCAGACCC 
TCTGGTCATG 
GGCTTGTGAG 
TCCTCATGGA 



CATTGGACAT 
TTTCTTTATT 
AGACTCTTTT 
GCTCCTTGGG 
CTTGCTGTGT 
GAAAGAGGAT 



AGACCTCCGC 
AATGCTGTCC 
CCTGTCAGAA 
TGGATGACCG 
ACTTCATGGG 
AGAGACGACT 
TTTTTGCCTA 
GGACCTATGG 
ACCTCTTTGT 
TCTGGAGAGA 
TCTTGTTGCG 
CATATTGGGA 
GTCAGCACCC 
TTGTCTGTAG 
AGGGACCTTT 
CTTCAGCTGA 
AAGCTGCCAA 
TAGCGGATGC 



TGGCCATTTC 

ACCGTGGAGC 

TATCCTTCTG 

GGAGTCGTGG 

ATTCAACTGT 

CTTGGTGAGA 

CCTCACTTTA 

C C AAATG AAA 

CTGGATGCAT 

CATTGATTTT 

GTGGGAACAA 

CTGGCGGGAT 

CACAAATCCT 

CCGATTGGAG 

ACGGCGTAAT 

TGTAGAATTT 

TTTCAGCTTT 

CTCTCAAAGC 
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AGCATGCACA 


ATGCCTTGCA 


CATCTATATG 


AATGGAACAA 


TGTCCCAGGT 


AC A(j(jLjA 1L1 
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GC C AACGATC 


CTATCTTCCT 


TCTTCACCAT 


GCATTTGTTG 


ACAGTATTTT 


I CjjACjL- Pi\J 1 ^J^J 


1200 


CTCCGAAGGC 


ACCGTCCTCT 
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TATCCAGAAG 


CCAATGCACC 
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AACCAGGAAT 


CCTACATGGT 


TCCTTTTATA 


CCACTGTACA 


GAAATGGTGA 
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TCATCCAAAG 


ATCTGGGCTA 


TGACTATAGC 
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ATTCAGACCC 


AGACTCTTTT 
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CAAGACTACA 


TTAAGTCCTA 


TTTGGAACAA 


GCGAGTCGGA 


TCTGGTCATG 
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GCGGCGATGG 


TAGGGGCCGT 


CCTCACTGCC 


CTGCTGGCAG 


GGCTTGTGAG 


CTTGCTGTGT 


1500 


CGTCACAAGA 


GAAAGCAGCT 


TCCTGAAGAA 


AAGCAGCCAC 


TCCTCATGGA 


GAAAGAGGAT 
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TACCACAGCT 


TGTATCAGAG 


CCATTTA 
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(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 
TAAATGGCTC TGATACAAGC T 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
GCAAGTTTGG CTTTTGGGGA 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
ATGCTCCTGG CTGTTTTGTA CTG 
(2) INFORMATION FOR SEQ ID NO: 65: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
CTGAATCTTG TAGATAGCTA TAGTCATAGC CCAGATCTTT GGATGAAATA AAGAAAT C AC 
CATTTCTGTA CAGTGGTATA AAAGGAACCA TGTAGGATTC CCGGTTATGT CTAATGGGTG 
CATTGGCTTC TGGATAAACT TCTTGAAGAG GACGGTG 
(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 
CTGAATCTTG TAGATAGCTA TAGTCATAGC CCAGATCTTT GGATGAAATA AAGAAATCAC 
CATTTCTGTA CAGTGGTATA AAAGGAACCA TGTAGGATTC CTGGTTATGT CCAATGGGTG 
CATTGGCTTC TGGATAAACT TCTTGAAGAG GACGGTG 
(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
GGTTGGCCAA TCTACTCCCA GG 
(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
GCTCACTCAG TGTGGCAAAG 

- 346 - 



60 
120 
157 



60 
120 
157 



22 



20 




(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
GGTTGGCCAA TCTACTCCCA GGAGCAGGGA GGGCAGGAGC 
AGGGCAGAGC CATCTATTGC TTACATTTGC TTCTGACACA 
CAAACAGACA CCATGGTGCA CCTGACTCCT GAGGAGAAGT 
GGCAAGGTGA ACGTGGATGA AGTTGGTGGT GAGGCCCTGG 
CAAGACAGGT TTAAGGAGAC CAATAGAAAC TGGGCATGTG 
GTTTCTGATA GGCACTGACT CTCTCTGCCT ATTGGTCTAT 
GGTGGTCTAC CCTTGGACCC AGAGGTTCTT TGAGTCCTTT 
TGCTGTTATG GGCAACCCTA AGGTGAAGGC TCATGGCAAG 
TGATGGCCTG GCTCACCTGG ACAACCTCAA GGGCACCTTT 
(2) INFORMATION FOR SEQ ID NO: 7 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:70: 
GGTTGGCCAA TCTACTCCCA GGAGCAGGGA GGGCAGGAGC 
AGGGCAGAGC CATCTATTGC TTACATTTGC TTCTGACACA 
CAAACAGACA CCATGGTGCA TCTGACTCCT GAGGAGGTCT 
CAAGGTGAAC GTGGATGAAG TTGGTGGTGA GGCCCTGGGC 
AGACAGGTTT AAGGAGACCA ATAGAAACTG GGCATGTGGA 
TTCTGATAGG CACTGACTCT CTCTGCCTAT TGGTCTATTT 
TGGTCTACCC TTGGACCCAG AGGTTCTTTG AGTCCTTTGG 
CTGTTATGGG CAACCCTAAG GTGAAGGCTC ATGGCAAGAA 
ATGGCCTGGC TCACCTGGAC AACCTCAAGG GCACCTTTGC 
(2) INFORMATION FOR SEQ ID NO: 71: 



CAGGGCTGGG 
ACTGTGTTCA 
CTGCCGTTAC 
GCAGGTTGGT 
GAGACAGAGA 
TTTCCCACCC 
GGGGATCTGT 
AAAGTGCTCG 
GCCACACTGA 



CATAAAAGTC 

CTAGCAACCT 

TGCCCTGTGG 

ATCAAGGTTA 

AGACTCTTGG 

TTAGGCTGCT 

CCACTCCTGA 

GTGCCTTTAG 

GTGAGC 



CAGGGCTGGG 
ACTGTGTTCA 
GCCGTTACTG 
AGGTTGGTAT 
GACAGAGAAG 
TCCCACCCTT 
GGATCTGTCC 
AGTGCTCGGT 
CACACTGAGT 



CATAAAAGTC 
CTAGCAACCT 
CCCTGTGGGG 
CAAGGTTACA 
ACTCTTGGGT 
AGGCTGCTGG 
ACTCCTGATG 
GCCTTTAGTG 
GAGC 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
GGTTGGCCAA TCTACTCCCA GGAGCAGGGA GGGCAGGAGC CAGGGCTGGG CATAAAAGTC 
AGGGCAGAGC CATCTATTGC TTACATTTGC TTCTGACACA ACTGTGTTCA CTAGCAACCT 
CAAACAGACA CCATGGTGCA CCTGACTCCT GAGGAGAAGT CTGCCGTTAC TGCCCTGTGG 
GGCAAGGTGA ACGTGGATGA AGTTGGTGGT GAGGCCCTGG GCAGGTTGGT ATCAAGGTTA 
CAAGACAGGT TTAAGGAGAC CAATAGAAAC TGGGCATGTG GAGACAGAGA AGACTCTTGG 
GTTTCTGATA GGCACTGACT CTCTCTGCCT ATTGGTCTAT TTTCCCACCC TTAGGCTGCT 
GGTGGTCTAC CCTTGGACCT AGAGGTTCTT TGAGTCCTTT GGGGATCTGT CCACTCCTGA 
TGCTGTTATG GGCAACCCTA AGGTGAAGGC TCATGGCAAG AAAGTGCTCG GTGCCTTTAG 
TGATGGCCTG GCTCACCTGG ACAACCTCAA GGGCACCTTT G C C AC AC TG A GTGAGC 
(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
GGTTGGCCAA TCTACTCCCA GGAGCAGGGA GGGCAGGAGC CAGGGCTGGG CATAAAAGTC 
AGGGCAGAGC CATCTATTGC TTACATTTGC TTCTGACACA ACTGTGTTCA CTAGCAACCT 
CAAACAGACA CCATGGTGCA CCTGACTCCT GAGGAGAAGT CTGCCGTTAC TGCCCTGTGG 
GGCAAGGTGA ACGTGGATGA AGTTGGAGGT GAGGCCCTGG GCAGGTTGGT ATCAAGGTTA 
CAAGACAGGT TTAAGGAGAC CAATAGAAAC TGGGCATGTG GAGACAGAGA AGACTCTTGG 
GTTTCTGATA GGCACTGACT CTCTCTGCCT ATTGGTCTAT TTTCCCACCC TTAGGCTGCT 
GGTGGTCTAC CCTTGGACCC AGAGGTTCTT TGAGTCCTTT GGGGATCTGT CCACTCCTGA 
TGCTGTTATG GGCAACCCTA AGGTGAAGGC TCATGGCAAG AAAGTGCTCG GTGCCTTTAG 
TGATGGCCTG GCTCACCTGG ACAACCTCAA GGGCACCTTT GCCACACTGA GTGAGC 
(2) INFORMATION FOR SEQ ID NO: 73: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 64 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : RNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 
GAAUACUCAA GCUUGCAUGC CUGCAGGUCG ACUCUAGAGG AUCCCCGGGU ACCGAGCUCG 60 
AAUU 

(2) INFORMATION FOR SEQ ID NO:74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:74: 
GGCTGACAAG AAGGAAACTC 
(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
CCAGGCGGCG GCTAGGAGAG ATGGG 
(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: 

GGCTGACAAG AAGGAAACTC GCTGAGACAG CAGGGACTTT C C AC AAGGGG ATGTTACGGG 6 0 

GAGGTACTGG GGAGGAGCCG GTCGGGAACG CCCACTCTCT TGATGTATAA ATATCACTGC 12 0 

ATTTCGCTCT GTATTCAGTC GCT CTGCGGA GAGGCTGGCA GATTGAGCCC TGGGAGGTTC 180 

TCTCCAGCAC TAGCAGGTAG AGCCTGGGTG TTCCCTGCTA GACTCTCACC AGCACTTGGC 24 0 

CGGTGCTGGG CAGAGTGGCT CCACGCTTGC TTGCTTAAAG ACCTCTTCAA TAAAGCTGCC 30 0 
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ATTTTAGAAG TAGGCCAGTG TGTGTTCCCA TCTCTCCTAG CCGCCGCCTG G 
(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
GGCTGACAAG AAGGAAACTC GCTGAGATAG CAGGGACTTT C C AC AAGGGG ATGTTATGGG 
GAGGAGCCGG TCGGGAACAC CCACTTTCTT GATGTATAAA TATCACTGCA TTTCGCTCTG 
TATTCAGTCG CTCTGCGGAG AGGCTGGCAG ATTGAGCCCT GGGAGGTTCT CTCCAGCACT 
AG C AGGT AG A GCCTGGGTGT TCCCTGCTAG ACTCTCACCA GCACTTAGCC AGTGCTGGGC 
AGAGTGGCTC CACGCTTGCT TGCTTAAAGA CCTCTTCAAT AAAGCTGC C A TTTTAGAAGT 
AAGCCAGTGT GTGTTCCCAT CTCTCCTAGC CGCCGCCTGG 
(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
GGCTGACAAG AAGGAAACTC GCTGAGATAG CAGGGACTTT CC AC AAGGGG ATGTTATGGG 
GAGGAGCCGG TCGGGAACAC CCACTTTCTT GGTGTATAAA TATCACTGCA TTTCGCTCTG 
TATTCAGTCG CTCTGCGGAG AGGCTGGCAG ATTGAGCCCT GGGAGGTTCT CTCCAGCACT 
AGCAGGTAGA GCCTGGGTGT TCCCTGCTAG ACTCTCACCA GCACTTGGCC AGTGCTGGGC 
AGAGTGGCTC CACGCTTGCT TGCTTAAAGA CCTCTTCAAT AAAGCTGCCA TTTTAGAAGT 
AAGCCAGTGT GTGTTCCCAT CTCTCCTAGC CGCCGCCTGG 
(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
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GGCTGACAAG AAGGAAACTC GCTGAGACAG CAGGGACTTT C C AC AAGGGG ATGTTACGGG 
GAGGTACTGG GGAGGAGCCG GTCGGGAACG CCCCCTCTCT TGATGTATAA ATATCACTGC 
ATTTCGCTCT GTATTCAGTC GCTCTGCGGA GAGGCTGGCA GATTGAGCCC TGGGAGGTTC 
TCTCCAGCAC TAGCAGGTAG AGCCTGGGTG TTCCCTGCTA GACTCTCACC AGCACTTGGC 
CGGTGCTGGG CAGAGTGGCT CCACGCTTGC TTGCTTAAAG ACCTCTTCAA TAAAGCTGCC 
ATTTTAGAAG TAGGCTAGTG TGTGTTCCCA TCTCTCCTAG CCGCCGCCTG G 
(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
GGCTGACAAG AAGGAAACTC GCTGAAACAG CAGGGACTTT CCACAAGGGG ATGTTACGGG 
GAGGTACTGG GAAGGAGCCG GTCGGGAACG CCCACTTTCT TGATGTATAA ATATCACTGC 
ATTTCGCTCT GTATTCAGTC GCTCTGCGGA GAGGCTGGCA GATTGAGCCC TGGGAGGTTC 
TCTCCAGCAC TAGCAGGTAG AGCCTGGGTG TTCCCTGCTA GACTCTCACC AGCACTTGGC 
CGGTGCTGGG CAGAGTGACT CCACGCTTGC TTGCTTAAAG CCCTCTTCAA TAAAGCTGCC 
ATTTTAGAAG TAAGCTAGTG TGTGTTCCCA TCTCTCCTAG CCGCCGCCTG G 
(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
GGCTGACAAG AAGGAAACTC GCTGAGACAG CAGGGACTTT CCACAAGGGG ATGTTACGGA 
GAGGTACTGG GGAGGAGCCG GTCGGGAACG CCCACTCTCT TGATGTATAA ATATCACTGC 
ATTTCGCTCT GTATTCAGTC GCTCTGCGGA GAGGCTGGCA GATTGAGCCC TAGGAGGTTC 
TCTCCAGCAC TAGCAGGTAG AGCCTGAGTG TTCCCTGCTA AACTCTCACC AGCACTTGGC 
CGGTGCTGGG CAGAGCGGCT CCACGCTTGC TTGCTTAAAG ACCTCTTCAA TAAAGCTGCC 
ATTTTAGAAG TAGGCTAGTG TGTGTTCCCA TCTCTCCTAG CCGCCGCCTG G 
(2) INFORMATION FOR SEQ ID NO:82: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:82: 

GGTTGGCCAA TCTACTCCCA GGAGCAGGGA GGGCAGGAGC CAGGGCTGGG CATAAAAGTC 6 0 

AGGGCAGAGC CATCTATTGC TTACATTTGC TTCTGACACA ACTGTGTTCA CTAGCAACCT 12 0 

CAAACAGACA C C ATGGTGC A TCTGACTCCT GAGGAGAAGT CTGCCGTTAC TGCCCTGTGG 180 

GGCAAGGTGA AC GTGG ATG A AGTTGGTGGT AAGGCCCTGG GCAGGTTGGT ATCAAGGTTA 24 0 

CAAGACAGGT TTAAGGAGAC CAATAGAAAC TGGGCATGTG GAGACAGAGA AGACTCTTGG 3 00 

GTTTCTGATA GGCACTGACT CTCTCTGCCT ATTGGT C TAT TTTCCCACCC TTAGGCTGCT 3 60 

GGTGGTCTAC CCTTGGACCC AGAGGTTCTT TGAGTCCTTT GGGGATCTGT CCACTCCTGA 420 

TGCTGTTATG GGCAACCCTA AGGTGAAGGC TCATGGCAAG AAAGTGCTCG GTGCCTTTAG 4 80 

TGATGGCCTG GCTCACCTGG ACAACCTCAA GGGCACCTTT GCCACACTGA GTGAGC 53 6 
(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

GGTTGGCCAA TCTACTCCCA GGAGCAGGGA GGGCAGGAGC CAGGGCTGGG CATAAAAGTC 6 0 

AGGGCAGAGC CATCTATTGC TTACATTTGC TTCTGACACA ACTGTGTTCA CTAGCAACCT 12 0 

CAAACAGACA CCATGGTGCA CCTGACTCCT GAGGAGAAGT CTGCCGTTAC TGCCCTGTGG 180 

GGCAAGGTGA ACGTGGATGA AGTTGGTGGT GAGGCCCTGG GCAGGTTGGT ATCAAGGTTA 24 0 

CAAGACAGGT TTAAGGAGAC CAATAGAAAC TGGGCATGTG GAGACAGAGA AGACTCTTGG 300 

GTTTCTGATA GGCACTGACT CTCTCTGCCT ATTAGTCTAT TTTCCCACCC TTAGGCTGCT 360 

GGTGGTCTAC CCTTGGACCC AGAGGTTCTT TGAGTCCTTT GGGGATCTGT CCACTCCTGA 42 0 

TGCTGTTATG GGCAACCCTA AGGTGAAGGC TCATGGCAAG AAAGTGCTCG GTGCCTTTAG 480 

TGATGGCCTG GCTCACCTGG ACAACCTCAA GGGCACCTTT GCCACACTGA GTGAGC 53 6 

(2) INFORMATION FOR SEQ ID NO : 84 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
CACCGTCCTC TTCAAGAAGT TTATCCAGAA GCCAATGCAC C C ATTGG AC A TAACCAGGAA 
TCCTACATGG TTCCTTTTAT ACCACTGTAC AGAAATGGTG ATTTCTTTAT TTCATCCAAA 
GATCTGGGCT ATGACTATAG CTATCTACAA GATTCAG 
(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
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Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
1 5 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Glv Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
1 -- 40 45 
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Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 HO 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 l feu 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 I 75 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 
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Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Gly Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 
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He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arq Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arq Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
3 740 745 750 

Pro Val Arg Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asp Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu 

(2) INFORMATION FOR SEQ ID NO : 86 : 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 548 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 



Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 

25 30 
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Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 



Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 

90 95 



Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 HO 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 I 25 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 I 70 1 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
7 245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 
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Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Gly Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Ala Cys Lys Leu Gly Thr Gly Arg Arg 
530 535 540 

Phe Thr Thr Ser 
545 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
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Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 



10 



15 



Leu Val Asp Gly His His 



Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 



20 



25 



30 



Glv Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
1 - ~ 45 



35 



40 



Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 



50 



55 



60 



Val Val Phe Asp Ala Ly 



Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 



65 



70 



75 



80 



Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 



85 



90 



Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 



100 
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110 



Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Ly 



115 



120 



125 



Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 



130 



135 



140 



Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 



145 



150 
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Glv Tvr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
1 175 



165 



170 



Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
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Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 



195 



200 



205 



Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 



210 



215 



220 



Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 

240 



225 



230 



235 



Lvs Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
y 255 



245 



250 



Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 



260 
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Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 



275 



280 



285 



Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 



290 
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300 



Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 



310 



315 



320 



Gly 
305 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
r 335 



325 



330 
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Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 



Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Gly Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arq Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 
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Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala Ser His Pro Leu 
675 680 685 

Arg Gly Gly Pro Gly Leu His 
690 695 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 310 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 io 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Glv Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
105 HO 
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Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 
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Lys 


Leu 


Ser 


Trp 


Asp 
245 


Leu 


Ala 


Lys 


Val 


Arg 
250 


Thr 


Asp 


Leu 


Pro 


Leu 
255 


Glu 


Val 


Asp 


Phe 


Ala 
2 6 0 


Lys 


Arg 


Arg 


Glu 


Pro 
2 65 


Asp 


Arg 


Glu 


Arg 


Leu 
270 


Arg 


Ala 


Phe 


Leu 


Glu 
275 


Arg 


Leu 


Glu 


Phe 


Gly 
280 


Ser 


Leu 


Leu 


His 


Glu 
285 


Phe 


Gly 


Leu 


Leu 


Glu 
290 


Ser 


Pro 


Lys 


Ser 


Trp 
295 


Arg 


Gly 


Cys 


He 


Pro 
300 


Trp 


Pro 


Cys 


Pro 


Trp 
305 


Arg 


Trp 


Arg 


Trp 


Gly 
310 























(2) INFORMATION FOR SEQ ID NO:89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 322 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg He Asn Ser Gly 
1 5 10 15 

Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val Asp Gly 
20 25 30 

His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys Gly Leu Thr Thr 
35 40 45 

Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys Ser Leu 
50 55 60 

Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He Val Val Phe Asp 
65 70 75 80 

Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly Gly Tyr Lys Ala 

90 95 



85 

Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala Leu He 
100 105 HO 

Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu Glu Val Pro Gly 
115 120 125 

Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys Lys Ala Glu Lys 
130 135 140 

Glu Glv Tyr Glu Val Arg He Leu Thr Ala Asp Lys Asp Leu Tyr Gin 
145 150 155 160 

Leu Leu Ser Asp Arg He His Val Leu His Pro Glu Gly Tyr Leu He 
165 170 175 

Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro Asp Gin Trp 
180 185 190 
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Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp Asn Leu Pro Gly 
195 200 205 

Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu Leu Glu Glu Trp 
210 215 220 

Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu Lys Pro Ala 
225 230 235 240 

He Arg Glu Lys He Leu Ala His Met Asp Asp Leu Lys Leu Ser Trp 
245 250 255 

Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu Val Asp Phe Ala 
260 265 270 

Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala Phe Leu Glu Arg 
275 280 285 

Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu Glu Ser Pro 
290 295 300 

Lys Ser Trp Arg Gly Cys He Pro Trp Pro Cys Pro Trp Arg Trp Arg 
305 310 315 320 

Trp Gly 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 528 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:90: 

Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val Asp Gly 
15 10 15 

His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys Gly Leu Thr Thr 
20 25 30 

Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys Ser Leu 
35 40 45 

Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He Val Val Phe Asp 
50 55 60 

Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly Gly Tyr Lys Ala 
65 70 75 80 

Glv Arq Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala Leu He 
85 90 95 

Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu Glu Val Pro Gly 
100 105 HO 

Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys Lys Ala Glu Lys 
115 120 125 
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Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys Asp Leu Tyr Gin 
130 135 140 



Leu Leu Ser Asp Arg He His Val Leu His Pro Glu Gly Tyr Leu lie 
145 150 155 160 

Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro Asp Gin Trp 
165 170 I 75 

Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp Asn Leu Pro Gly 
180 185 I 90 

Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu Leu Glu Glu Trp 
195 200 205 

Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu Lys Pro Ala 
210 215 220 

He Arg Glu Lys He Leu Ala His Met Asp Asp Leu Lys Leu Ser Trp 
225 230 235 240 

Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu Val Asp Phe Ala 
245 250 255 

Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala Phe Leu Glu Arg 
260 265 270 

Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu Glu Ser Pro 
275 280 285 

Lys He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val Ala 
290 295 300 

Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
305 310 315 -*2U 

Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His Thr 
325 330 335 

Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro 
340 345 350 

Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly 
355 360 365 

Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu 
370 375 380 

Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 
385 390 395 400 

Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val 
405 410 415 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala Arg 
420 425 430 

Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
435 440 445 

Val Arq Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
450 455 460 
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3tA 



Phe 
465 


Pro 


Arg 


Leu 


Glu 


Glu 
470 


Asp 


Glu 


Leu 


V cL± 


485 


Glu 


Arg 


Leu 


Ala 


Lys 
500 


Glu 


Val 


Leu 


Glu 


Val 
515 


Glu 


Val 


Gly 



475 



480 



490 



495 



505 



510 



520 



525 



(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg He Asn Ser Gly 
15 io 15 

Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val Asp Gly 
20 25 30 

His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys Gly Leu Thr Thr 
35 40 45 

Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys Ser Leu 
50 55 60 

Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He Val Val Phe Asp 
65 70 75 80 

Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly Gly Tyr Lys Ala 
85 90 95 

Glv Arq Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala Leu He 
100 105 HO 

Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu Glu Val Pro Gly 
115 120 125 

Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys Lys Ala Glu Lys 
130 135 140 

Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys Asp Leu Tyr Gin 
145 150 155 ISO 

Leu Leu Ser Asp Arg He His Val Leu His Pro Glu Gly Tyr Leu He 
165 170 175 

Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro Asp Gin Trp 
180 185 190 

Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp Asn Leu Pro Gly 
195 200 205 
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Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu Leu Glu Glu Trp 
210 215 220 

Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu Lys Pro Ala 
225 230 235 240 

He Arq Glu Lys He Leu Ala His Met Asp Asp Leu Lys Leu Ser Trp 
245 250 255 

Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu Val Asp Phe Ala 
260 265 270 

Lys Arq Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala Phe Leu Glu Arg 
275 280 285 

Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu Glu Ser Pro 
290 295 300 

Lys Ala Ala Leu Glu His His His His His His 
305 310 315 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1182 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:92: 
ATGGAGGAGC CGCAGTCAGA TCCTAGCGTC GAGCCCCCTC TGAGTCAGGA AACATTTTCA 
GACCTATGGA AACTACTTCC TGAAAACAAC GTTCTGTCCC CCTTGCCGTC CCAAGCAATG 
GATGATTTGA TGCTGTCCCC GGAC GAT AT T GAACAATGGT TCACTGAAGA CCCAGGTCCA 
GATGAAGCTC CCAGAATGCC AGAGGCTGCT CCCCCCGTGG CCCCTGCACC AGCAGCTCCT 
ACACCGGCGG CCCCTGCACC AGCCCCCTCC TGGCCCCTGT CATCTTCTGT CCCTTCCCAG 
AAAACCTACC AGGGCAGCTA CGGTTTCCGT CTGGGCTTCT TGCATTCTGG GACAGCCAAG 
TCTGTGACTT GCACGTACTC CCCTGCCCTC AACAAGATGT TTTGCCAACT GGCCAAGACC 
TGCCCTGTGC AGCTGTGGGT TGATTCCACA CCCCCGCCCG GCACCCGCGT CCGCGCCATG 
GCCATCTACA AGCAGTCACA GCACATGACG GAGGTTGTGA GGCGCTGCCC CCACCATGAG 
CGCTGCTCAG ATAGCGATGG TCTGGCCCCT CCTCAGCATC TTATCCGAGT GGAAGGAAAT 
TTGCGTGTGG AGTATTTGGA TGACAGAAAC ACTTTTCGAC ATAGTGTGGT GGTGCCCTAT 
GAGCCGCCTG AGGTTGGCTC TGACTGTACC ACCATCCACT ACAACTACAT GTGTAACAGT 
TCCTGCATGG GCGGCATGAA CCGGAGGCCC ATCCTCACCA TCATCACACT GGAAGACTCC 
AGTGGTAATC TACTGGGACG GAACAGCTTT GAGGTGCGTG TTTGTGCCTG TCCTGGGAGA 
GACCGGCGCA CAGAGGAAGA GAATCTCCGC AAGAAAGGGG AGCCTCACCA CGAGCTGCCC 
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CCAGGGAGCA CTAAGCGAGC ACTGCCCAAC AACACCAGCT CCTCTCCCCA GCCAAAGAAG 
AAACCACTGG ATGGAGAATA TTTCACCCTT CAGATCCGTG GGCGTGAGCG CTTCGAGATG 
TTCCGAGAGC TGAATGAGGC CTTGGAACTC AAGGATGCCC AGGCTGGGAA GGAGCCAGGG 
GGGAGCAGGG CTCACTCCAG CCACCTGAAG T C C AAAAAGG GTCAGTCTAC CTCCCGCCAT 
AAAAAACTCA TGTTCAAGAC AGAAGGGCCT GACTCAGACT GA 
(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1182 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
ATGGAGGAGC CGCAGTCAGA TCCTAGCGTC GAGCCCCCTC TGAGTCAGGA AACATTTTCA 
GACCTATGGA AACTACTTCC TGAAAACAAC GTTCTGTCCC CCTTGCCGTC CCAAGCAATG 
GATGATTTGA TGCTGTCCCC GGACGATATT GAACAATGGT TCACTGAAGA CCCAGGTCCA 
GATGAAGCTC CCAGAATGCC AGAGGCTGCT CCCCCCGTGG CCCCTGCACC AGCAGCTCCT 
ACACCGGCGG CCCCTGCACC AGCCCCCTCC TGGCCCCTGT CATCTTCTGT CCCTTCCCAG 
AAAACCTACC AGGGCAGCTA CGGTTTCCGT CTGGGCTTCT TGCATTCTGG GACAGCCAAG 
TCTGTGACTT GCACGTACTC CCCTGCCCTC AACAAGATGT TTTGCCAACT GGCCAAGACC 
TGCCCTGCGC AGCTGTGGGT TGATTCCACA CCCCCGCCCG GCACCCGCGT CCGCGCCATG 
GCCATCTACA AGCAGTCACA GCACATGACG GAGGTTGTGA GGCGCTGCCC CCACCATGAG 
CGCTGCTCAG ATAGCGATGG TCTGGCCCCT CCTCAGCATC TTATCCGAGT GGAAGGAAAT 
TTGCGTGTGG AGTATTTGGA TGACAGAAAC ACTTTTCGAC ATAGTGTGGT GGTGCCCTAT 
GAGCCGCCTG AGGTTGGCTC TGACTGTACC ACCATCCACT ACAACTACAT GTGTAACAGT 
TCCTGCATGG GCGGCATGAA CCGGAGGCCC ATCCTCACCA TCATCACACT GGAAGACTCC 
AGTGGTAATC TACTGGGACG GAACAGCTTT GAGGTGCGTG TTTGTGCCTG TCCTGGGAGA 
GACCGGCGCA CAGAGGAAGA GAATCTCCGC AAGAAAGGGG AGCCTCACCA CGAGCTGCCC 
CCAGGGAGCA CTAAGCGAGC ACTGCCCAAC AACACCAGCT CCTCTCCCCA GCCAAAGAAG 
AAACCACTGG ATGGAGAATA TTTCACCCTT CAGATCCGTG GGCGTGAGCG CTTCGAGATG 
TTCCGAGAGC TGAATGAGGC CTTGGAACTC AAGGATGCCC AGGCTGGGAA GGAGCCAGGG 
GGGAGCAGGG CTCACTCCAG CCACCTGAAG TCCAAAAAGG GTCAGTCTAC CTCCCGCCAT 
AAAAAACTCA TGTTCAAGAC AGAAGGGCCT GACTCAGACT GA 
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(2) INFORMATION FOR SEQ ID NO : 94 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1182 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 



ATGGAGGAGC CGCAGTCAGA TCCTAGCGTC GAGCCCCCTC TGAGTCAGGA AACATTTTCA 
GACCTATGGA AACTACTTCC TGAAAACAAC GTTCTGTCCC CCTTGCCGTC CCAAGCAATG 
GATGATTTGA TGCTGTCCCC GGACGATATT GAACAATGGT TCACTGAAGA CCCAGGTCCA 
GATGAAGCTC CCAGAATGCC AGAGGCTGCT CCCCCCGTGG CCCCTGCACC AGCAGCTCCT 
ACACCGGCGG CCCCTGCACC AGCCCCCTCC TGGCCCCTGT CATCTTCTGT CCCTTCCCAG 
AAAACCTACC AGGGCAGCTA CGGTTTCCGT CTGGGCTTCT TGCATTCTGG GACAGCCAAG 
TCTGTGACTT GCACGTACTC CCCTGCCCTC AACAAGATGT TTTGCCAACT GGCCAAGACC 
TGCCCTGTGC AGCTGTGGGT TGATTCCACA CCCCCGCCCG GCACCCGCGT CCGCGCCATG 
GCCATCTACA AGCAGTCACA GCACATGACG GAGGTTGTGA GGCGCTGCCC CCACCATGAG 
CGCTGCTCAG ATAGCGATGG TCTGGCCCCT CCTCAGCATC TTATCCGAGT GGAAGGAAAT 
TTGCGTGTGG AGTATTTGGA TGACAGAAAC ACTTTTCGAC ATAGTGTGGT GGTGCCCTAT 
GAGCCGCCTG AGGTTGGCTC TGACTGTACC ACCATCCACT ACAACTACAT GTGTAACAGT 
TCCTGCATGG GCGGCATGAA CCGGAGACCC ATCCTCACCA TCATCACACT GGAAGACTCC 
AGTGGTAATC TACTGGGACG GAACAGCTTT GAGGTGCGTG TTTGTGCCTG TCCTGGGAGA 
GACCGGCGCA CAGAGGAAGA GAATCTCCGC AAGAAAGGGG AGCCTCACCA CGAGCTGCCC 
CCAGGGAGCA CTAAGCGAGC ACTGCCCAAC AACACCAGCT CCTCTCCCCA GCCAAAGAAG 
AAACCACTGG ATGGAGAATA TTTCACCCTT CAGATCCGTG GGCGTGAGCG CTTCGAGATG 
TTCCGAGAGC TGAATGAGGC CTTGGAACTC AAGGATGCCC AGGCTGGGAA GGAGCCAGGG 
GGGAGCAGGG CTCACTCCAG CCACCTGAAG TCCAAAAAGG GTCAGTCTAC CTCCCGCCAT 
AAAAAACTCA TGTTCAAGAC AGAAGGGCCT GACTCAGACT GA 
(2) INFORMATION FOR SEQ ID NO: 95: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
TCTGGGCTTC TTGCATTCTG 
(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:96: 

GTTGGGCAGT GCTCGCTTAG 

(2) INFORMATION FOR SEQ ID NO:97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 



TCTGGGCTTC 


TTGCATTCTG 


GGACAGCCAA 


GTCTGTGACT 


TGC AC GT ACT 


CCCCTGCCCT 


60 


CAACAAGATG 


TTTTGCCAAC 


TGGCCAAGAC 


CTGCCCTGTG 


CAGCTGTGGG 


TTGATTCCAC 


120 


ACCCCCGCCC 


GGCACCCGCG 


TCCGCGCCAT 


GGCCATCTAC 


AAGCAGTCAC 


AGCACATGAC 


180 


GGAGGTTGTG 


AGGCGCTGCC 


CCCACCATGA 


GCGCTGCTCA 


GATAGCGATG 


GTCTGGCCCC 


240 


TCCTCAGCAT 


CTTATCCGAG 


TGGAAGGAAA 


TTTGCGTGTG 


GAGTATTTGG 


ATGACAGAAA 


300 


CACTTTTCGA 


CATAGTGTGG 


TGGTGCCCTA 


TGAGCCGCCT 


GAGGTTGGCT 


CTGACTGTAC 


360 


CACCATCCAC 


TACAACTACA 


TGTGTAACAG 


TTCCTGCATG 


GGCGGCATGA 


ACCGGAGGCC 


420 


CATCCTCACC 


ATCATCACAC 


TGGAAGACTC 


CAGTGGTAAT 


CTACTGGGAC 


GGAACAGCTT 


480 


TGAGGTGCGT 


GTTTGTGCCT 


GTC CTGGGAG 


AGAC CGGCGC 


ACAGAGGAAG 


AGAATCTCCG 


540 


CAAGAAAGGG 


GAGCCTCACC 


ACGAGCTGCC 


CCCAGGGAGC 


ACTAAGCGAG 


CACTGCCCAA 


600 


C 












601 



(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 



GTTGGGCAGT 


GCTCGCTTAG 


TGCTCCCTGG 


/~1 /"I f~t f\ 7\ /""I Z™ 1 

GGGC AGC_ I 


X UvJ J- UriOvjL. J. 


CCCCTTTCTT 


60 


GCGGAGATTC 


TCTTCCTCTG 


TGCGCCGGTC 


m »-i rn /"i /™i /"i t\ /~i /"i 7\ 

TCTCLLACjCjA 


n a pr r* a r* zi z\ zx 


CACGCACCTC 


120 


AAAGCTGTTC 


CGTCCCAGTA 


GATTACCACT 


GGAGTC 1 1 LC 


J.U1 un X un 


TGGTGAGGAT 


180 


GGGCCTCCGG 


TTCATGCCGC 


CCATGCAGGA 


ACTGTTACAC 




AOTGGATGGT 


240 


GGTACAGTCA 


GAGCCAACCT 


CAGGCGGCTC 


ATAGGGCACC 




GTCGAAAAGT 


300 


GTTTCTGTCA 


TCCAAATACT 


CCACACGCAA 


ATTTCCTTCC 


ACTCGGATAA 


GATGCTGAGG 


360 


AGGGGCCAGA 


CCATCGCTAT 


CTGAGCAGCG 


CTCATGGTGG 


GGGCAGCGCC 


TCACAACCTC 


420 


CGTCATGTGC 


TGTGACTGCT 


TGTAGATGGC 


CATGGCGCGG 


ACGCGGGTGC 


CGGGCGGGGG 


480 


TGTGGAAT C A 


ACCCACAGCT 


GCACAGGGCA 


GGTCTTGGCC 


AGTTGGCAAA 


ACATCTTGTT 


540 


GAGGGCAGGG 


GAGTACGTGC 


AAGT C AC AG A 


CTTGGCTGTC 


C C AGAATGC A 


AGAAGC C C AG 


600 
601 



A 



(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 



TCTGGGCTTC 


TTGCATTCTG 


GG AC AG C C AA 


GTCTGTGACT 


TGCACGTACT 


CCCCTGCCCT 


60 


CAACAAGATG 


TTTTGCCAAC 


TGGCCAAGAC 


CTGCCCTGCG 


CAGCTGTGGG 


TTGATTCCAC 


120 


ACCCCCGCCC 


GGCACCCGCG 


TCCGCGCCAT 


GGCCATCTAC 


AAGCAGTCAC 


AGCACATGAC 


180 


GGAGGTTGTG 


AGGCGCTGCC 


CCCACCATGA 


GCGCTGCTCA 


GATAGCGATG 


GTCTGGCCCC 


240 


TCCTCAGCAT 


CTTATCCGAG 


TGGAAGGAAA 


TTTGCGTGTG 


GAGTATTTGG 


ATGACAGAAA 


300 


CACTTTTCGA 


CATAGTGTGG 


TGGTGCCCTA 


TGAGCCGCCT 


GAGGTTGGCT 


CTGACTGTAC 


360 


CACCATCCAC 


TACAACTACA 


TGTGTAACAG 


TTCCTGCATG 


GGCGGCATGA 


ACCGGAGGCC 


420 


CATCCTCACC 


ATCATCACAC 


TGGAAGACTC 


CAGTGGTAAT 


CTACTGGGAC 


GGAACAGCTT 


480 


TGAGGTGCGT 


GTTTGTGCCT 


GTCCTGGGAG 


AGACCGGCGC 


ACAGAGGAAG 


AGAATCTCCG 


540 


CAAGAAAGGG 


GAGCCTCACC 


ACGAGCTGCC 


CCCAGGGAGC 


ACTAAGCGAG 


CACTGCCCAA 


600 
601 



c 

(2) INFORMATION FOR SEQ ID NO:100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:100: 



GTTGGGCAGT 


bL I L.CjL. 1 1 ALj 








CCCCTTTCTT 


60 


GCGGACjAi I(_ 


rp/~i rT"i rj-i rp /"""P/""' 

IC 1 ILL. ± L. IIjj 




rp p rp f-i /-i (-1 7\ ri r* A 




PACGCACCTC 


120 


AAAGCTGTTC 


CGTCCCAGTA 


GATTACCACT 


GGAGTCTTCC 


AGTGTGATGA 


TGGTGAGGAT 


180 


GGGCCTCCGG 


TTCATGCCGC 


CCATGCAGGA 


ACTGTTACAC 


ATGTAGTTGT 


AGTGGATGGT 


240 


GGTACAGTCA 


GAGCCAACCT 


CAGGCGGCTC 


ATAGGGCACC 


ACCACACTAT 


GTCGAAAAGT 


300 


GTTTCTGTCA 


TCCAAATACT 


CCACACGCAA 


ATTTCCTTCC 


ACTCGGATAA 


GATGCTGAGG 


360 


AGGGGCCAGA 


CCATCGCTAT 


CTGAGCAGCG 


CTCATGGTGG 


GGGCAGCGCC 


TCACAACCTC 


420 


CGTCATGTGC 


TGTGACTGCT 


TGTAGATGGC 


CATGGCGCGG 


ACGCGGGTGC 


CGGGCGGGGG 


480 


TGTGGAATCA 


ACCCACAGCT 


GCGCAGGGCA 


GGTCTTGGCC 


AGTTGGCAAA 


ACATCTTGTT 


540 


GAGGGCAGGG 


GAGTACGTGC 


AAGTCACAGA 


CTTGGCTGTC 


C C AG AATGC A 


AGAAGC C C AG 


600 



601 



(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 101: 



TCTGGGCTTC 


TTGCATTCTG 


GGACAGCCAA 


GTCTGTGACT 


TGCACGTACT 


CCCCTGCCCT 


60 


CAACAAGATG 


TTTTGCCAAC 


TGGCCAAGAC 


CTGCCCTGTG 


CAGCTGTGGG 


TTGATTCCAC 


120 


ACCCCCGCCC 


GGCACCCGCG 


TCCGCGCCAT 


GGCCATCTAC 


AAGCAGTCAC 


AGCACATGAC 


180 


GGAGGTTGTG 


AGGCGCTGCC 


CCCACCATGA 


GCGCTGCTCA 


GATAGCGATG 


GTCTGGCCCC 


240 


TCCTCAGCAT 


CTTATCCGAG 


TGGAAGGAAA 


TTTGCGTGTG 


GAGTATTTGG 


ATGACAGAAA 


300 


CACTTTTCGA 


CATAGTGTGG 


TGGTGC CCTA 


TGAGCCGCCT 


GAGGTTGGCT 


C TG AC T GT AC 


360 


CACCATCCAC 


TACAACTACA 


TGTGTAACAG 


TTCCTGCATG 


GGCGGCATGA 


ACCGGAGACC 


420 


CATCCTCACC 


ATCATCACAC 


TGGAAGACTC 


CAGTGGTAAT 


CTACTGGGAC 


GGAACAGCTT 


480 


TGAGGTGCGT 


GTTTGTGCCT 


GTCCTGGGAG 


AGACCGGCGC 


ACAGAGGAAG 


AGAATCTCCG 


540 


CAAGAAAGGG 


GAGCCTCACC 


ACGAGCTGCC 


CCCAGGGAGC 


ACTAAGCGAG 


CACTGCCCAA 


600 
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601 



C 



(2) INFORMATION FOR SEQ ID NO:102: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE 


: DNA (genomic) 






(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 2 






GTTGGGCAGT 


GCTCGCTTAG 


TGCTCCCTGG 


GGGCAGCTCG 


TGGTGAGGCT 


CLCL. 1 1 1L1 i 


GCGGAGATTC 


TCTTCCTCTG 


TGCGCCGGTC 


TCTCCCAGGA 


CAGGCACAAA 


CACGCACCTC 


AAAGCTGTTC 


CGTCCCAGTA 


GATTACCACT 


GGAGTCTTCC 


AGTGTGATGA 


TGGTGAGGAT 


GGGTCTCCGG 


TTCATGCCGC 


CCATGCAGGA 


ACTGTTACAC 


ATGTAGTTGT 


AGTGGATGGT 


GGTACAGTCA 


GAGCCAACCT 


CAGGCGGCTC 


ATAGGGCACC 


ACCACACTAT 


GTCGAAAAGT 


GTTTCTGTCA 


TCCAAATACT 


CCACACGCAA 


ATTTCCTTCC 


ACTCGGATAA 


GATGC TGAGG 


AGGGGCCAGA 


CCATCGCTAT 


CTGAGCAGCG 


CTCATGGTGG 


GGGCAGCGCC 


TCACAACCTC 


CGTCATGTGC 


TGTGACTGCT 


TGTAGATGGC 


CATGGCGCGG 


ACGCGGGTGC 


CGGGCGGGGG 


TGTGGAATCA 


ACCCACAGCT 


GCACAGGGCA 


GGTCTTGGCC 


AGTTGGCAAA 


ACATCTTGTT 


GAGGGCAGGG 


GAGTACGTGC 


AAGTCACAGA 


CTTGGCTGTC 


CCAGAATGCA 


AGAAGCCCAG 



(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
GAGGATGGGA CTCCGGTTCA TG 
(2) INFORMATION FOR SEQ ID NO:104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 



60 
120 
180 
240 
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A 
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CATGAACCGG AGTCCCATCC TCAC 
(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
GCACAAACAT GCACCTCAAA GCT 
(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
CAGCTTTGAG GTGCATGTTT GT 
(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
TCTGGGCTTC TTGCATTCTG GGACAGCCAA GTCTGTGACT TGCACGTACT CCCCTGCCCT 
CAACAAGATG TTTTGCCAAC TGGCCAAGAC CTGCCCTGTG CAGCTGTGGG TTGATTCCAC 
ACCCCCGCCC GGCACCCGCG TCCGCGCCAT GGCCATCTAC AAGCAGTCAC AGCACATGAC 
GGAGGTTGTG AGGCGCTGCC CCCACCATGA GCGCTGCTCA GATAGCGATG GTCTGGCCCC 
TCCTCAGCAT CTTATCCGAG TGGAAGGAAA TTTGCGTGTG GAGTATTTGG ATGACAGAAA 
CACTTTTCGA CATAGTGTGG TGGTGCCCTA TGAGCCGCCT GAGGTTGGCT CTGACTGTAC 
CACCATCCAC TACAACTACA TGTGTAACAG TTCCTGCATG GGCGGCATGA ACCGGAGTCC 
CATCCTCACC AT CAT C AC AC TGGAAGACTC CAGTGGTAAT CTACTGGGAC GGAACAGCTT 
TGAGGTGCGT GTTTGTGCCT GTCCTGGGAG AGACCGGCGC ACAGAGGAAG AGAATCTCCG 
CAAGAAAGGG GAGCCTCACC ACGAGCTGCC CCCAGGGAGC ACTAAGCGAG CACTGCCCAA 
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c 



601 



(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 8: 



GTTGGGCAGT 


GCTCGCTTAG 


TGCTC CCTGG 


GGGCAGCTCG 


TGGTGAGGCT 


CCCCTTTCTT 


60 


GCGGAGATTC 


TCTTCCTCTG 


TGCGCCGGTC 


TCTCCCAGGA 


C AGG C AC AAA 


CACGCACCTC 


120 


AAAGCTGTTC 


CGTCCCAGTA 


GATTACCACT 


GGAGTCTTCC 


AGTGTGATGA 


TGGTGAGGAT 


180 


GGGACTCCGG 


TTCATGCCGC 


CCATGCAGGA 


ACTGTTACAC 


ATGTAGTTGT 


AGTGGATGGT 


240 


GGTACAGTCA 


GAGCCAACCT 


CAGGCGGCTC 


ATAGGGC AC C 


ACCACACTAT 


GTCGAAAAGT 


300 


GTTTCTGTCA 


TCCAAATACT 


CCACACGCAA 


ATTTCCTTCC 


ACTCGGATAA 


GATGCTGAGG 


360 


AGGGG C C AG A 


CCATCGCTAT 


CTGAGCAGCG 


CTCATGGTGG 


GGGCAGCGCC 


TCACAACCTC 


420 


CGTCATGTGC 


TGTGACTGCT 


TGTAGATGGC 


CATGGCGCGG 


ACGCGGGTGC 


CGGGCGGGGG 


480 


TGTGGAATCA 


ACCCACAGCT 


GCACAGGGCA 


GGTCTTGGCC 


AGTTGGCAAA 


ACATCTTGTT 


540 


GAGGGCAGGG 


GAGTACGTGC 


AAGTCACAGA 


CTTGGCTGTC 


CCAGAATGCA 


AGAAGC C C AG 


600 


A 












601 



(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

TCTGGGCTTC TTGCATTCTG GGACAGCCAA GTCTGTGACT TGCACGTACT CCCCTGCCCT 60 

CAACAAGATG TTTTGCCAAC TGGCCAAGAC CTGCCCTGTG CAGCTGTGGG TTGATTCCAC 12 0 

ACCCCCGCCC GGCACCCGCG TCCGCGCCAT GGCCATCTAC AAGCAGTCAC AGCACATGAC 18 0 

GGAGGTTGTG AGGCGCTGCC CCCACCATGA GCGCTGCTCA GATAGCGATG GTCTGGCCCC 24 0 

TCCTCAGCAT CTTATCCGAG TGGAAGGAAA TTTGCGTGTG GAGTATTTGG ATGACAGAAA 300 

CACTTTTCGA CATAGTGTGG TGGTGCCCTA TGAGCCGCCT GAGGTTGGCT CTGACTGTAC 36 0 

CACCATCCAC TACAACTACA TGTGTAACAG TTCCTGCATG GGCGGCATGA ACCGGAGGCC 42 0 
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CATCCTCACC ATCATCACAC TGGAAGACTC CAGTGGTAAT CTACTGGGAC GGAACAGCTT 
TGAGGTGCAT GTTTGTGCCT GTCCTGGGAG AGACCGGCGC ACAGAGGAAG AGAATCTCCG 
CAAGAAAGGG GAGCCTCACC ACGAGCTGCC CCCAGGGAGC ACTAAGCGAG CACTGCCCAA 



(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
GTTGGGCAGT GCTCGCTTAG TGCTCCCTGG GGGCAGCTCG TGGTGAGGCT CCCCTTTCTT 
GCGGAGATTC TCTTCCTCTG TGCGCCGGTC TCTCCCAGGA CAGGCACAAA CATGCACCTC 
AAAGCTGTTC CGTCCCAGTA GATTACCACT GGAGTCTTCC AGTGTGATGA TGGTGAGGAT 
GGGCCTCCGG TTCATGCCGC CCATGCAGGA ACTGTTACAC ATGTAGTTGT AGTGGATGGT 
GGTACAGTCA GAGCCAACCT CAGGCGGCTC ATAGGGC AC C ACCACACTAT GTCGAAAAGT 
GTTTCTGTCA TCCAAATACT CCACACGCAA ATTTCCTTCC ACTCGGATAA GATGCTGAGG 
AGGGGCCAGA CCATCGCTAT CTGAGCAGCG CTCATGGTGG GGGCAGCGCC TCACAACCTC 
CGTCATGTGC TGTGACTGCT TGTAGATGGC CATGGCGCGG ACGCGGGTGC CGGGCGGGGG 
TGTGGAATCA ACCCACAGCT GCACAGGGCA GGTCTTGGCC AGTTGGCAAA AC AT C TTGTT 
GAGGGCAGGG GAGTACGTGC AAGTCACAGA CTTGGCTGTC CCAGAATGCA AGAAGCCCAG 
A 

(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 427 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 
TCTGGGCTTC TTGCATTCTG GGACAGCCAA GTCTGTGACT TGCACGTACT CCCCTGCCCT 
CAACAAGATG TTTTGCCAAC TGGCCAAGAC CTGCCCTGTG CAGCTGTGGG TTGATTCCAC 
ACCCCCGCCC GGCACCCGCG TCCGCGCCAT GGCCATCTAC AAGCAGTCAC AGCACATGAC 
GGAGGTTGTG AGGCGCTGCC CCCACCATGA GCGCTGCTCA GATAGCGATG GTCTGGCCCC 



480 
540 
600 
601 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 base pairs 



TCCTCAGCAT CTTATCCGAG TGGAAGGAAA TTTGCGTGTG GAGTATTTGG ATGACAGAAA 300 
CACTTTTCGA CATAGTGTGG TGGTGCCCTA TGAGCCGCCT GAGGTTGGCT CTGACTGTAC 360 
CACCATCCAC TACAACTACA TGTGTAACAG TTCCTGCATG GGCGGCATGA ACCGGAGTCC 42 0 

427 

CATCCTC 

(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 196 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 
CATGAACCGG AGTCCCATCC TCACCATCAT CACACTGGAA GACTCCAGTG GTAATCTACT 60 
GGGAC GGAAC AGCTTTGAGG TGCGTGTTTG TGCCTGTCCT GGGAGAGACC GGCGCACAGA 12 0 

GGAAGAGAAT CTCCGCAAGA AAGGGGAGCC TCACCACGAG CTGCCCCCAG GGAGCACTAA 18 0 

GCGAGCACTG CCCAAC 196 
(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 498 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

TCTGGGCTTC TTGCATTCTG GGACAGCCAA GTCTGTGACT TGCACGTACT CCCCTGCCCT 6 0 

CAACAAGATG TTTTGCCAAC TGGCCAAGAC CTGCCCTGTG CAGCTGTGGG TTGATTCCAC 12 0 

ACCCCCGCCC GGCACCCGCG TCCGCGCCAT GGCCATCTAC AAGCAGTCAC AGCACATGAC 18 0 

GGAGGTTGTG AGGCGCTGCC CCCACCATGA GCGCTGCTCA GATAGCGATG GTCTGGCCCC 24 0 

TCCTCAGCAT CTTATCCGAG TGGAAGGAAA TTTGCGTGTG GAGTATTTGG ATGACAGAAA 3 00 

CACTTTTCGA CATAGTGTGG TGGTGCCCTA TGAGCCGCCT GAGGTTGGCT CTGACTGTAC 3 60 

CACCATCCAC TACAACTACA TGTGTAACAG TTCCTGCATG GGCGGCATGA ACCGGAGGCC 42 0 

CATCCTCACC AT CAT C AC AC TGGAAGACTC CAGTGGTAAT CTACTGGGAC GGAACAGCTT 480 
TGAGGTGCAT GTTTGTGC 

(2) INFORMATION FOR SEQ ID NO: 114: 



498 



375 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 
CAGCTTTGAG GTGCATGTTT GTGCCTGTCC TGGGAGAGAC CGGCGCACAG AGGAAGAGAA 
TCTCCGCAAG AAAGGGGAGC CTCACCACGA GCTGCCCCCA GGGAGCACTA AGCGAGCACT 
GCCCAAC 

(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
GGTTTTTCTT TGAGGTTTAG 
(2) INFORMATION FOR SEQ ID NO : 116 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 
GCGACACTCC ACCATAGAT 

(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 
CTGTCTTCAC GCAGAAAGC 

(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 
GCACGGTCTA CGAGACCTC 
(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 
GATCTACTAG TCATATGGAT 
(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 
TCGGTACCCG GGGATCCGAT 
(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 281 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

CTGTCTTCAC GCAGAAAGCG TCTGGCCATG GCGTTAGTAT GAGTGTCGTG CAGCCTCCAG 

GACCCCCCCT CCCGGGAGAG CCATAGTGGT CTGCGGAACC GGTGAGTACA CCGGAATTGC 

CAGGACGACC GGGTCCTTTC TTGGATAAAC CCGCTCAATG CCTGGAGATT TGGGCGTGCC 

CCCGCAAGAC TGCTAGCCGA GTAGTGTTGG GTCGCGAAAG GCCTTGTGGT ACTGCCTGAT 

AGGGTGCCTG CGAGTGCCCC GGGAGGTCTC GTAGACCGTG C 

(2) INFORMATION FOR SEQ ID NO:122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 386 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 
CTGTCTTCAC GCAGAAAGCG TCTGGCCATG GCGTTAGTAT GAGTGTCGTG CAGCCTCCAG 
GACCCCCCCT CCCGGGAGAG CCATAGTGGT CTGCGGAACC GGTGACTGTC TTCACGCAGA 
AAGCGTCTAG CCATGGCGTT AGTATGAGTG TCGTGCAGCC TCCAGGACCC CCCCTCCCGG 
G AGAG C CAT A GTGGTCTGCG GAACCGGTGA GTACACCGGA ATTGCCAGGA CGACCGGGTC 
CTTTCTTGGA TCAACCCGCT CAATGCCTGG AGATTTGGGC GTGCCCCCGC AAGACTGCTA 
GCCGAGTAGT GTTGGGTCGC GAAAGGCCTT GTGGTACTGC CTGATAGGGT GCTTGCGAGT 
GCCCCGGGAG GTCTCGTAGA CCGTGC 
(2) INFORMATION FOR SEQ ID NO: 12 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 281 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

CTGTCTTCAC GCAGAAAGCG TCTAGCCATG GCGTTAGTAT GAGTGTCGTG CAGCCTCCAG 

GTCCCCCCCT CCCGGGAGAG CCATAGTGGT CTGCGGAACC GGTGAGTACA CCGGAATTGC 

CAGGACGACC GGGTCCTTTC TTGGATCAAC CCGCTCAATG CCTGGAGATT TGGGCGTGCC 

CCCGCGAGAC TGCTAGCCGA GTAGTGTTGG GTCGCGAAAG GCCTTGTGGT ACTGCCTGAT 

AGGGTGCTTG CGAGTGCCCC GGGAGGTCTC GTAGACCGTG C 
(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 
CTGTCTTCAC GCAGAAAGCG TCTAGCCATG GCGTTAGTAT GAGTGTCGTG CAGCCTCCAG 
GACCCCCCCT CCCGGGAGAG CCATAGTGGT CTGCGGAACC GGTGAGTACA CCGGAATTGC 
CAGGACGACC GGGTCCTTTC GTGGATGTAA CCCGCTCAAT GCCTGGAGAT TTGGGCGTGC 
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CCCCGCAAGA CTGCTAGCCG AGTAGTGTTG GGTCGCGAAA GGCCTTGTGG TACTGCCTGA 24 0 

TAGGGTGCTT GCGAGTGCCC CGGGAGGTCT CGTAGACCGT GC 2 82 

(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 281 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 
CTGTCTTCAC GCAGAAAGCG TCTAGCCATG GCGTTAGTAT GAGTGTCGTA CAGCCTCCAG 6 0 

GCCCCCCCCT CCCGGGAGAG CCATAGTGGT CTGCGGAACC GGTGAGTACA CCGGAATTGC 12 0 

CGGGAAGACT GGGTCCTTTC TTGGATAAAC CCACTCTATG CCCGGCCATT TGGGCGTGCC 18 0 

CCCGCAAGAC TGCTAGCCGA GTAGCGTTGG GTTGCGAAAG GCCTTGTGGT ACTGCCTGAT 24 0 

AGGGTGCTTG CGAGTACCCC GGGAGGTCTC GTAGACCGTG C 2 81 

(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 6: 

CTGTCTTCAC GCAGAAAGCG CCTAGCCATG GCGTTAGTAC GAGTGTCGTG CAGCCTCCAG 6 0 

GACCCCCCCT CCCGGGAGAG CCATAGTGGT CTGCGGAACC GGTGAGTACA CCGGAATCGC 12 0 

TGGGGTGACC GGGTCCTTTC TTGGAGCAAC CCGCTCAATA CCCAGAAATT TGGGCGTGCC 18 0 

CCCGCGAGAT CACTAGCCGA GTAGTGTTGG GTCGCGAAAG GCCTTGTGGT ACTGCCTGAT 24 0 

AGGGTGCTTG CGAGTGCCCC GGGAGGTCTC GTAGACCGTG C 281 
(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 
GCACGGTCTA CGAGACCTCC CGGGGCACTC GCAGGCACCC TATCAGGCAG TACCACAAGG 60 
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CCTTTCGCGA CCCAACACTA CTCGGCTAGC AGTCTTGCGG GGGCACGCCC AAATCTCCAG 



120 



GCATTGAGCG GGTTTATCCA AGAAAGGACC CGGTCGTCCT GGCAATTCCG GTGTACTCAC 



180 



CGGTTCCGCA GACCACTATG GCTCTCCCGG GAGGGGGGGT CCTGGAGGCT GCACGACACT 



240 



CATACTAACG CCATGGCCAG ACGCTTTCTG CGTGAAGACA G 



281 



(2) INFORMATION FOR SEQ ID NO: 12 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 281 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 
GCACGGTCTA CGAGACCTCC CGGGGCACTC GCAAGCACCC TATCAGGCAG TACCACAAGG 6 0 

CCTTTCGCGA CCCAACACTA CTCGGCTAGC AGTCTTGCGG GGGCACGCCC AAATCTCCAG 12 0 

GCATTGAGCG GGTTGATCCA AGAAAGGACC CGGTCGTCCT GGCAATTCCG GTGTACTCAC 18 0 

CGGTTCCGCA GACCACTATG GCTCTCCCGG GAGGGGGGGT CCTGGAGGCT GCACGACACT 24 0 

CATACTAACG CCATGGCTAG ACGCTTTCTG CGTGAAGACA G 281 
(2) INFORMATION FOR SEQ ID NO: 12 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 281 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 9: 

GCACGGTCTA CGAGACCTCC CGGGGCACTC GCAAGCACCC TATCAGGCAG TACCACAAGG 60 

CCTTTCGCGA CCCAACACTA CTCGGCTAGC AGTCTCGCGG GGGCACGCCC AAATCTCCAG 12 0 

GCATTGAGCG GGTTGATCCA AGAAAGGACC CGGTCGTCCT GGCAATTCCG GTGTACTCAC 180 

CGGTTCCGCA GACCACTATG GCTCTCCCGG GAGGGGGGGA CCTGGAGGCT GCACGACACT 24 0 

CATACTAACG CCATGGCTAG ACGCTTTCTG CGTGAAGACA G 2 81 

(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

GCACGGTCTA CGAGACCTCC CGGGGCACTC GCAAGCACCC TATCAGGCAG TACCACAAGG 60 

CCTTTCGCGA CCCAACACTA CTCGGCTAGC AGTCTTGCGG GGGCACGCCC AAATCTCCAG 12 0 

GCATTGAGCG GGTTACATCC ACGAAAGGAC CCGGTCGTCC TGGCAATTCC GGTGTACTCA 180 

CCGGTTCCGC AGACCACTAT GGCTCTCCCG GGAGGGGGGG TCCTGGAGGC TGCACGACAC 24 0 

TCATACTAAC GCCATGGCTA GACGCTTTCT GCGTGAAGAC AG 2 82 
(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 
GCACGGTCTA CGAGACCTCC CGGGGTACTC GCAAGCACCC TATCAGGCAG TACCACAAGG 60 
CCTTTCGCAA CCCAACGCTA CTCGGCTAGC AGTCTTGCGG GGGCACGCCC AAATGGCCGG 120 
GCATAGAGTG GGTTTATC C A AG AAAGGAC C CAGTCTTCCC GGCAATTCCG GTGTACTCAC 180 
CGGTTCCGCA GACCACTATG GCTCTCCCGG GAGGGGGGGG CCTGGAGGCT GTACGACACT 240 
CATACTAACG CCATGGCTAG ACGCTTTCTG CGTGAAGACA G 2 81 

(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 
GCACGGTCTA CGAGACCTCC CGGGGCACTC GCAAGCACCC TATCAGGCAG TACCACAAGG 60 
CCTTTCGCGA CCCAACACTA CTCGGCTAGT GATCTCGCGG GGGCACGCCC AAATTTCTGG 12 0 

GTATTGAGCG GGTTGCTCCA AGAAAGGACC CGGTCACCCC AGCGATTCCG GTGTACTCAC 180 
CGGTTCCGCA GACCACTATG GCTCTCCCGG GAGGGGGGGT CCTGGAGGCT GCACGACACT 24 0 

CGTACTAACG CCATGGCTAG GCGCTTTCTG CGTGAAGACA G 2 81 

(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

- 381 - 






(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 



ATCAACATCC GGCCGGTGGT 



20 



(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 
GGGGCCTCGC TACGGACCAG 2 0 

(2) INFORMATION FOR SEQ ID NO: 13 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

ATCAACATCC GGCCGGTGGT CGCCGCGATC AAGGAGTTCT TCGGCACCAG CCAGCTGAGC 60 

CAATTCATGG ACCAGAACAA CCCGCTGTCG GGGTTGACCC ACAAGCGCCG ACTGTCGGCG 12 0 

CTGGGGCCCG GCGGTCTGTC ACGTGAGCGT GCCGGGCTGG AGGTCCGCGA CGTGCACCCG 18 0 

TCGCACTACG GCCGGATGTG CCCGATCGAA ACCCCTGAGG GGCCCAACAT CGGTCTGATC 24 0 

GGCTCGCTGT CGGTGTACGC GCGGGTCAAC CCGTTCGGGT TCATCGAAAC GCCGTACCGC 300 

AAGGTGGTCG ACGGCGTGGT TAGCGACGAG ATCGTGTACC TGACCGCCGA CGAGGAGGAC 360 

CGCCACGTGG TGGCACAGGC CAATTCGCCG ATCGATGCGG ACGGTCGCTT CGTCGAGCCG 42 0 

CGCGTGCTGG TCCGCCGCAA GGCGGGCGAG GTGGAGTACG TGCCCTCGTC TGAGGTGGAC 48 0 

TACATGGACG TCTCGCCCCG CCAGATGGTG TCGGTGGCCA CCGCGATGAT TCCCTTCCTG 54 0 

GAGCACGACG ACGCCAACCG TGCCCTCATG GGGGCAAACA TGCAGCGCCA GGCGGTGCCG 60 0 

CTGGTCCGTA GCGAGGCCCC 62 0 
(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 6: 

ATCAACATCC GGCCGGTGGT CGCCGCGATC AAGGAGTTCT TCGGCACCAG CCAGCTGAGC 60 

CAATTCATGG ACCAGAACAA CCCGCTGTCG GGGTTGACCT ACAAGCGCCG AC TGTCGGCG 12 0 

CTGGGGCCCG GCGGTCTGTC ACGTGAGCGT GCCGGGCTGG AGGTCCGCGA CGTGCACCCG 180 

TCGCACTACG GCCGGATGTG CCCGATCGAA ACCCCTGAGG GGCCCAACAT CGGTCTGATC 240 

GGCTCGCTGT CGGTGTACGC GCGGGTCAAC CCGTTCGGGT TCATCGAAAC GCCGTACCGC 300 

AAGGTGGTCG ACGGCGTGGT TAGCGACGAG ATCGTGTACC TGACCGCCGA CGAGGAGGAC 360 

CGCCACGTGG TGGCACAGGC CAATTCGCCG ATCGATGCGG ACGGTCGCTT CGTCGAGCCG 42 0 

CGCGTGCTGG TCCGCCGCAA GGCGGGCGAG GTGGAGTACG TGCCCTCGTC TGAGGTGGAC 4 80 

TACATGGACG TCTCGCCCCG CCAGATGGTG TCGGTGGCCA CCGCGATGAT TCCCTTCCTG 54 0 

GAGCACGACG ACGCCAACCG TGCCCTCATG GGGGCAAACA TGCAGCGCCA GGCGGTGCCG 600 

CTGGTCCGTA GCGAGGCCCC 620 
(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 7: 

ATCAACATCC GGCCGGTGGT CGCCGCGATC AAGGAGTTCT TCGGCACCAG CCAGCTGAGC 60 

CAATTCATGG ACCAGAACAA CCCGCTGTCG GGGTTGACCC ACAAGCGCCG ACTGTTGGCG 12 0 

CTGGGGCCCG GCGGTCTGTC ACGTGAGCGT GCCGGGCTGG AGGTCCGCGA CGTGCACCCG 180 

TCGCACTACG GCCGGATGTG CCCGATCGAA ACCCCTGAGG GGCCCAACAT CGGTCTGATC 24 0 

GGCTCGCTGT CGGTGTACGC GCGGGTCAAC CCGTTCGGGT TCATCGAAAC GCCGTACCGC 3 00 

AAGGTGGTCG ACGGCGTGGT TAGCGACGAG ATCGTGTACC TGACCGCCGA CGAGGAGGAC 3 60 

CGCCACGTGG TGGCACAGGC CAATTCGCCG ATCGATGCGG ACGGTCGCTT CGTCGAGCCG 420 

CGCGTGCTGG TCCGCCGCAA GGCGGGCGAG GTGGAGTACG TGCCCTCGTC TGAGGTGGAC 4 80 

TACATGGACG TCTCGCCCCG CCAGATGGTG TCGGTGGCCA CCGCGATGAT TCCCTTCCTG 540 

GAGCACGACG ACGCCAACCG TGCCCTCATG GGGGCAAACA TGCAGCGCCA GGCGGTGCCG 6 00 

CTGGTCCGTA GCGAGGCCCC 62 0 
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(2) INFORMATION FOR SEQ ID NO: 138: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 138: 



GGGGCCTCGC 


TACGGACCAG 


CGGCACCGCC 


TGGCGCTGCA 


TGTTTGCCCC 


CATGAGGGCA 


60 


CGGTTGGCGT 


CGTCGTGCTC 


CAGGAAGGGA 


ATCATCGCGG 


TGGCCACCGA 


CACCATCTGG 


120 


CGGGGCGAGA 


CGTCCATGTA 


GTCCACCTCA 


GACGAGGGCA 


CGTACTCCAC 


CTCGCCCGCC 


180 


TTGCGGCGGA 


CCAGCACGCG 


CGGCTCGACG 


AAGCGACCGT 


CCGCATCGAT 


CGGCGAATTG 


240 


GCCTGTGCCA 


CCACGTGGCG 


GTCCTCCTCG 


TCGGCGGTCA 


GGTACACGAT 


CTCGTCGCTA 


300 


ACCACGCCGT 


CGACCACCTT 


GCGGTAC GGC 


GTTTCGATGA 


ACCCGAACGG 


GTTGACCCGC 


360 


GCGTACACCG 


ACAGCGAGCC 


GATCAGACCG 


ATGTTGGGCC 


CCTCAGGGGT 


TTCGATCGGG 


420 


CACATCCGGC 


CGTAGTGCGA 


CGGGTGCACG 


TCGCGGACCT 


CCAGCCCGGC 


ACGCTCACGT 


480 


GACAGACCGC 


CGGGCCCCAG 


CGCCGACAGT 


CGGCGCTTGT 


GGGTCAACCC 


CGACAGCGGG 


540 


TTGTT CTGGT 


CCATGAATTG 


GCTCAGCTGG 


CTGGTGCCGA 


AGAACTCCTT 


GATCGCGGCG 


600 


ACCACCGGCC 


GGATGTTGAT 
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(2) INFORMATION FOR SEQ ID NO: 13 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 9: 

GGGGCCTCGC TACGGACCAG CGGCACCGCC TGGCGCTGCA TGTTTGCCCC CATGAGGGCA 60 

CGGTTGGCGT CGTCGTGCTC CAGGAAGGGA ATCATCGCGG TGGCCACCGA CACCATCTGG 12 0 

CGGGGCGAGA CGTCCATGTA GTCCACCTCA GACGAGGGCA CGTACTCCAC CTCGCCCGCC 18 0 

TTGCGGCGGA CCAGCACGCG CGGCTCGACG AAGCGACCGT CCGCATCGAT CGGCGAATTG 24 0 

GCCTGTGCCA CCACGTGGCG GTCCTCCTCG TCGGCGGTCA GGTACACGAT CTCGTCGCTA 30 0 

ACCACGCCGT CGACCACCTT GCGGTACGGC GTTTCGATGA ACCCGAACGG GTTGACCCGC 360 

GCGTACACCG ACAGCGAGCC GATCAGACCG ATGTTGGGCC CCTCAGGGGT TTCGATCGGG 42 0 

CACATCCGGC CGTAGTGCGA CGGGTGCACG TCGCGGACCT CCAGCCCGGC ACGCTCACGT 480 
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GACAGACCGC CGGGCCCCAG CGCCGACAGT CGGCGCTTGT AGGTCAACCC CGACAGCGGG 540 

TTGTTCTGGT CCATGAATTG GCTCAGCTGG CTGGTGCCGA AGAACTCCTT GATCGCGGCG 600 

ACCACCGGCC GGATGTTGAT 62 0 
(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 0: 



GGGGCCTCGC 


TACGGACCAG 


CGGCACCGCC 


TGGCGCTGCA 


TGTTTGCCCC 


CATGAGGGCA 


60 


CGGTTGGCGT 


CGTCGTGCTC 


CAGGAAGGGA 


ATCATCGCGG 


TGGCCACCGA 


CACCATCTGG 


120 


CGGGGCGAGA 


CGTCCATGTA 


GTCCACCTCA 


GACGAGGGCA 


CGTACTCCAC 


CTCGCCCGCC 


180 


TTGCGGCGGA 


CCAGCACGCG 


CGGCTCGACG 


AAGCGACCGT 


CCGCATCGAT 


CGGCGAATTG 


240 


GCCTGTGCCA 


CCACGTGGCG 


GTCCTCCTCG 


TCGGCGGTCA 


GGTACACGAT 


CTCGTCGCTA 


300 


ACCACGCCGT 


CGACCACCTT 


GCGGTACGGC 


GTTTCGATGA 


ACCCGAACGG 


GTTGACCCGC 


360 


GCGTACACCG 


ACAGCGAGCC 


GATCAGACCG 


ATGTTGGGCC 


CCTCAGGGGT 


TTCGATCGGG 


420 


CACATCCGGC 


CGTAGTGCGA 


CGGGTGCACG 


TCGCGGACCT 


CCAGCCCGGC 


ACGCTCACGT 


480 


GACAGACCGC 


CGGGCCCCAG 


CGCCAACAGT 


CGGCGCTTGT 


GGGTCAACCC 


CGACAGCGGG 


540 


TTGTTCTGGT 


CCATGAATTG 


GCTCAGCTGG 


CTGGTGCCGA 


AGAACTCCTT 


GATCGCGGCG 


600 


ACCACCGGCC 


GGATGTTGAT 
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(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

AGCTCGTATG GCACCGGAAC 

(2) INFORMATION FOR SEQ ID NO: 14 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

TTGACCTCCC ACCCGACTTG 

(2) INFORMATION FOR SEQ ID NO: 14 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 3: 



AGCTCGTATG 


GCACCGGAAC 


CGGTAAGGAC 


GCGATCACCA 


GCGGCATCGA 


GGTCGTATGG 


ACGAACACCC 


CGACGAAATG 


GGACAACAGT 


TTCCTCGAGA 


TCCTGTACGG 


CTACGAGTGG 


GAGCTGACGA 


AGAGCCCTGC 


TGGCGCTTGG 


CAATACACCG 


CCAAGGACGG 


CGCCGGTGCC 


GGCACCATCC 


CGGACCCGTT 


CGGCGGGCCA 


GGGCGCTCCC 


CGACGATGCT 


GGCCACTGAC 


CTCTCGCTGC 


GGGTGGATCC 


GATCTATGAG 


CGGATCACGC 


GTCGCTGGCT 


GGAACACCCC 


GAGGAATTGG 


CCGACGAGTT 


CGCCAAGGCC 


TGGTACAAGC 


TGATCCACCG 


AGACATGGGT 


CCCGTTGCGA 


GATACCTTGG 


GCCGCTGGTC 


CCCAAGCAGA 


CCCTGCTGTG 


GCAGGATCCG 


GTCCCTGCGG 


TCAGCCACGA 


CCTCGTCGGC 


GAAGCCGAGA 


TTGCCAGCCT 


TAAGAGCCAG 


ATCCGGGCAT 


CGGGATTGAC 


TGTCTCACAG 


CTAGTTTCGA 


CCGCATGGGC 


GGCGGCGTCG 


TCGTTCCGTG 


GTAGCGACAA 


GCGCGGCGGC 


GCCAACGGTG 


GTCGCATCCG 


CCTGCAGCCA 


CAAGTCGGGT 


GGGAGGTCAA 











(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 
AGCTCGTATG GCACCGGAAC CGGTAAGGAC GCGATCACCA CCGGCATCGA GGTCGTATGG 
ACGAACACCC CGACGAAATG GGACAACAGT TTCCTCGAGA TCCTGTACGG CTACGAGTGG 
GAGCTGACGA AGAGCCCTGC TGGCGCTTGG CAATACACCG CCAAGGACGG CGCCGGTGCC 
GGCACCATCC CGGACCCGTT CGGCGGGCCA GGGCGCTCCC CGACGATGCT GGCCACTGAC 
CTCTCGCTGC GGGTGGATCC GATCTATGAG CGGATCACGC GTCGCTGGCT GGAACACCCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
620 



60 
120 
180 
240 
300 
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GAGGAATTGG CCGACGAGTT CGCCAAGGCC TGGTACAAGC TGATCCACCG AGACATGGGT 3 SO 

CCCGTTGCGA GATACCTTGG GCCGCTGGTC CCCAAGCAGA CCCTGCTGTG GCAGGATCCG 420 

GTCCCTGCGG TCAGCCACGA CCTCGTCGGC GAAGC CGAGA TTGCCAGCCT TAAGAGCCAG 480 

ATCCGGGCAT CGGGATTGAC TGTCTCACAG CTAGTTTCGA CCGCATGGGC GGCGGCGTCG 54 0 

TCGTTCCGTG GTAGCGACAA GCGCGGCGGC GCCAACGGTG GTCGCATCCG CCTGCAGCCA 600 

CAAGTCGGGT GGGAGGTCAA 62 0 
(2) INFORMATION FOR SEQ ID NO : 145 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 



AGCTCGTATG 


GCACCGGAAC 


CGGTAAGGAC 


GCGATCACCA 


GCGGCATCGA 


GGTCGTATGG 


60 


ACGAACACCC 


CGACGAAATG 


GGACAACAGT 


TTCCTCGAGA 


TCCTGTACGG 


CTACGAGTGG 


120 


GAGCTGACGA 


AGAGCCCTGC, 


TGGCGCTTGG 


CAATACACCG 


CCAAGGACGG 


CGCCGGTGCC 


180 


GGCACCATCC 


CGGACCCGTT 


CGGCGGGCCA 


GGGCGCTCCC 


CGACGATGCT 


GGCCACTGAC 


240 


CTCTCGCTGC 


GGGTGGATCC 


GATCTATGAG 


CGGATCACGC 


GTCGCTGGCT 


GGAACACCCC 


300 


GAGGAATTGG 


CCGACGAGTT 


CGCCAAGGCC 


TGGTACAAGC 


TGATCCACCG 


AGACATGGGT 


360 


CCCGTTGCGA 


GATACCTTGG 


GCCGCTGGTC 


CCCAAGCAGA 


CCCTGCTGTG 


GCAGGATCCG 


420 


GTCCCTGCGG 


TCAGCCACGA 


CCTCGTCGGC 


GAAGC CGAGA 


TTGCCAGCCT 


TAAGAGCCAG 


480 


ATCCTGGCAT 


CGGGATTGAC 


TGTCTCACAG 


CTAGTTTCGA 


CCGCATGGGC 


GGCGGCGTCG 


540 


TCGTTCCGTG 


GTAGCGACAA 


GCGCGGCGGC 


GCCAACGGTG 


GTCGCATCCG 


CCTGCAGCCA 


600 


CAAGTCGGGT 


GGGAGGTCAA 
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(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:146: 

AGCTCGTATG GCACCGGAAC CGGTAAGGAC GCGATCACCA CCGGCATCGA GGTCGTATGG 60 

ACGAACACCC CGACGAAATG GGACAACAGT TTCCTCGAGA TCCTGTACGG CTACGAGTGG 12 0 
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GAGCTGACGA AGAGCCCTGC TGGCGCTTGG CAATACACCG CCAAGGACGG CGCCGGTGCC 18 0 

GGCACCATCC CGGACCCGTT CGGCGGGCCA GGGCGCTCCC CGACGATGCT GGCCACTGAC 24 0 

CTCTCGCTGC GGGTGGATCC GATC TATGAG CGGATCACGC GTCGCTGGCT GGAACACCCC 3 00 

GAGGAATTGG CCGACGAGTT CGCCAAGGCC TGGTACAAGC TGATCCACCG AGACATGGGT 3 60 

CCCGTTGCGA GATACCTTGG GCCGCTGGTC CCCAAGCAGA CCCTGCTGTG GCAGGATCCG 42 0 

GTCCCTGCGG TCAGCCACGA CCTCGTCGGC GAAGCCGAGA TTGCCAGCCT TAAGAGC CAG 4 80 

ATCCTGGCAT CGGGATTGAC TGTCTCACAG CTAGTTTCGA CCGCATGGGC GGCGGCGTCG 54 0 

TCGTTCCGTG GTAGCGACAA GCGCGGCGGC GCCAACGGTG GTCGCATCCG CCTGCAGCCA 600 

CAAGTCGGGT GGGAGGTCAA 62 0 
(2) INFORMATION FOR SEQ ID NO: 14 7: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 



TTGACCTCCC 


ACCCGACTTG 


TGGCTGCAGG 


CGGATGCGAC 


CACCGTTGGC 


GCCGCCGCGC 


60 


TTGTCGCTAC 


CACGGAACGA 


CGACGCCGCC 


GCCCATGCGG 


TCGAAACTAG 


CTGTGAGACA 


120 


GTCAATCCCG 


ATGCCCGGAT 


CTGGCTCTTA 


AGGCTGGCAA 


TCTCGGCTTC 


GCCGACGAGG 


180 


TCGTGGCTGA 


CCGCAGGGAC 


CGGATC CTGC 


CACAGCAGGG 


TCTGCTTGGG 


GACCAGCGGC 


240 


CCAAGGTATC 


TCGCAACGGG 


ACCCATGTCT 


CGGTGGATCA 


GCTTGTACCA 


GGCCTTGGCG 


300 


AACTCGTCGG 


CCAATTCCTC 


GGGGTGTTCC 


AGCCAGCGAC 


GCGTGATCCG 


CTCATAGATC 


360 


GGATCCACCC 


GCAGCGAGAG 


GTCAGTGGCC 


AGCATCGTCG 


GGGAGCGCCC 


TGGCCCGCCG 


420 


AACGGGTCCG 


GGATGGTGCC 


GGCACCGGCG 


CCGTCCTTGG 


CGGTGTATTG 


CCAAGCGCCA 


480 


GCAGGGCTCT 


TCGTCAGCTC 


CCACTCGTAG 


CCGTACAGGA 


TCTCGAGGAA 


ACTGTTGTCC 


540 


CATTTCGTCG 


GGGTGTTCGT 


CCATACGACC 


TCGATGCCGC 


TGGTGATCGC 


GTCCTTACCG 


600 


GTTCCGGTGC 


CATACGAGCT 
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(2) INFORMATION FOR SEQ ID NO: 14 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 8: 

TTGACCTCCC ACCCGACTTG TGGCTGCAGG CGGATGCGAC CACCGTTGGC GCCGCCGCGC 60 

TTGTCGCTAC CACGGAACGA CGACGCCGCC GCCCATGCGG TCGAAACTAG CTGTGAGACA 12 0 

GTCAATCCCG ATGCCCGGAT CTGGCTCTTA AGGCTGGCAA TCTCGGCTTC GCCGACGAGG 180 

TCGTGGCTGA CCGCAGGGAC CGGATCCTGC CACAGCAGGG TCTGCTTGGG GACCAGCGGC 240 

CCAAGGTATC TCGCAACGGG ACCCATGTCT CGGTGGATCA GCTTGTACCA GGCCTTGGCG 3 00 

AACTCGTCGG CCAATTCCTC GGGGTGTTCC AGCCAGCGAC GCGTGATCCG CTCATAGATC 360 

GGATCCACCC GCAGCGAGAG GTCAGTGGCC AGCATCGTCG GGGAGCGCCC TGGCCCGCCG 420 

AACGGGTCCG GGATGGTGCC GGCACCGGCG CCGTCCTTGG CGGTGTATTG CCAAGCGCCA 4 80 

GCAGGGCTCT TCGTCAGCTC CCACTCGTAG CCGTACAGGA TCTCGAGGAA ACTGTTGTCC 54 0 

CATTTCGTCG GGGTGTTCGT CCATACGACC TCGATGCCGG TGGTGATCGC GTCCTTACCG 6 00 

GTTCCGGTGC CATACGAGCT 62 0 
(2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 9: 

TTGACCTCCC ACCCGACTTG TGGCTGCAGG CGGATGCGAC CACCGTTGGC GCCGCCGCGC 6 0 

TTGTCGCTAC CACGGAACGA CGACGCCGCC GCCCATGCGG TCGAAACTAG CTGTGAGACA 12 0 

GTCAATCCCG ATGCCAGGAT CTGGCTCTTA AGGCTGGCAA TCTCGGCTTC GCCGACGAGG 180 

TCGTGGCTGA CCGCAGGGAC CGGATCCTGC CACAGCAGGG TCTGCTTGGG GACCAGCGGC 24 0 

CCAAGGTATC TCGCAACGGG ACCCATGTCT CGGTGGATCA GCTTGTACCA GGCCTTGGCG 30 0 

AACTCGTCGG CCAATTCCTC GGGGTGTTCC AGCCAGCGAC GCGTGATCCG CTCATAGATC 360 

GGATCCACCC GCAGCGAGAG GTCAGTGGCC AGCATCGTCG GGGAGCGCCC TGGCCCGCCG 42 0 

AACGGGTCCG GGATGGTGCC GGCACCGGCG CCGTCCTTGG CGGTGTATTG CCAAGCGCCA 480 

GCAGGGCTCT TCGTCAGCTC CCACTCGTAG CCGTACAGGA TCTCGAGGAA ACTGTTGTCC 540 

CATTTCGTCG GGGTGTTCGT CCATACGACC TCGATGCCGC TGGTGATCGC GTCCTTACCG 600 

GTTCCGGTGC CATACGAGCT 620 

(2) INFORMATION FOR SEQ ID NO: 15 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 0: 

TTGACCTCCC ACCCGACTTG TGGCTGCAGG CGGATGCGAC CACCGTTGGC GCCGCCGCGC 6 0 

TTGT CGCTAC CACGGAACGA CGACGCCGCC GCCCATGCGG TCGAAACTAG CTGTGAGACA 12 0 

GTCAATCCCG ATGCCAGGAT CTGGCTCTTA AGGCTGGCAA TCTCGGCTTC GCCGACGAGG 180 

TCGTGGCTGA CCGCAGGGAC CGGATCCTGC CACAGCAGGG TCTGCTTGGG GACCAGCGGC 240 

CCAAGGTATC TCGCAACGGG ACCCATGTCT CGGTGGATCA GCTTGTACCA GGCCTTGGCG 300 

AACTCGTCGG CCAATTCCTC GGGGTGTTCC AGCCAGCGAC GCGTGATCCG CTCATAGATC 3 60 

GGATCCACCC GCAGCGAGAG GTCAGTGGCC AGCATCGTCG GGGAGCGCCC TGGCCCGCCG 420 

AACGGGTCCG GGATGGTGCC GGCACCGGCG CCGTCCTTGG CGGTGTATTG CCAAGCGCCA 4 80 

GCAGGGCTCT TCGTCAGCTC CCACTCGTAG CCGTACAGGA TCTCGAGGAA ACTGTTGTCC 54 0 

CATTTCGTCG GGGTGTTCGT CCATACGACC TCGATGCCGG TGGTGATCGC GTCCTTACCG 600 

GTTCCGGTGC CATACGAGCT 620 
(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 
AGAGTTTGAT CCTGGCTCAG 2 0 

(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:152: 

GGCGGACGGG TGAGTAA 17 

(2) INFORMATION FOR SEQ ID NO:153: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l! 

CTGCTGCCTC CCGTAGGAGT 

(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 
ATGACGTCAA GTCATCATGG CCCTTACGA 
(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 
GTACAAGGCC CGGGAACGTA TTCACCG 
(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 
GCAACGAGCG CAACCC 

(2) INFORMATION FOR SEQ ID NO:157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 




* 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:157: 
ATGACGTCAA GTCATCATGG CCCTTA 
(2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1542 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 
AAATTGAAGA GTTTGATCAT GGCTCAGATT GAACGCTGGC GGCAGGCCTA ACACATGCAA 
GTCGAACGGT AACAGGAAGA AGCTTGCTTC TTTGCTGACG AGTGGCGGAC GGGTGAGTAA 
TGTCTGGGAA ACTGCCTGAT GGAGGGGGAT AACTACTGGA AACGGTAGCT AATACCGCAT 
AACGTCGCAA GACCAAAGAG GGGGAC CTTC GGGCCTCTTG CCATCGGATG TGCCCAGATG 
GGATTAGCTA GTAGGTGGGG TAACGGCTCA CCTAGGCGAC GATCCCTAGC TGGTCTGAGA 
GGATGACCAG CCACACTGGA ACTGAGACAC GGTCCAGACT CCTACGGGAG GCAGCAGTGG 
GGAATATTGC ACAATGGGCG CAAGCCTGAT GCAGCCATGC CGCGTGTATG AAGAAGGCCT 
TCGGGTTGTA AAGTACTTTC AGCGGGGAGG AAGGGAGTAA AGTTAATACC TTTGCTCATT 
GACGTTACCC GCAGAAGAAG CACCGGCTAA CTCCGTGCCA GCAGCCGCGG TAAT AC GGAG 
GGTGCAAGCG TTAATCGGAA TTACTGGGCG TAAAGCGCAC GCAGGCGGTT TGTTAAGTCA 
GATGTGAAAT CCCCGGGCTC AACCTGGGAA CTGCATCTGA TACTGGCAAG CTTGAGTCTC 
GTAGAGGGGG GTAGAATTCC AGGTGTAGCG GTGAAATGCG TAGAGATCTG GAGGAATACC 
GGTGGCGAAG GCGGCCCCCT GGACGAAGAC TGACGCTCAG GTGC GAAAGC GTGGGGAGCA 
AACAGGATTA GATACCCTGG TAGTCCACGC CGTAAACGAT GTCGACTTGG AGGTTGTGCC 
CTTGAGGCGT GGCTTCCGGA GCTAACGCGT TAAGTCGACC GCCTGGGGAG TACGGCCGCA 
AGGTTAAAAC TCAAATGAAT TGACGGGGGC CCGCACAAGC GGTGGAGCAT GTGGTTTAAT 
TCGATGCAAC GCGAAGAACC TTACCTGGTC TTGACATC C A CGGAAGTTTT CAGAGATGAG 
AATGTGCCTT CGGGAACCGT GAGACAGGTG CTGCATGGCT GTCGTCAGCT CGTGTTGTGA 
AATGTTGGGT TAAGTCCCGC AACGAGCGCA ACCCTTATCC TTTGTTGCCA GCGGTC CGGC 
CGGGAACTCA AAGGAGACTG CCAGTGATAA ACTGGAGGAA GGTGGGGATG ACGTCAAGTC 
ATCATGGCCC TTACGACCAG GGCTACACAC GTGCTACAAT GGCGCATACA AAGAGAAGCG 
ACCTCGCGAG AGCAAGCGGA CCTCATAAAG TGCGTCGTAG TCCGGATTGG AGTCTGCAAC 
TCGACTCCAT GAAGTCGGAA TCGCTAGTAA TCGTGGATCA GAATGCCACG GTGAATACGT 
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TCCCGGGCCT TGTACACACC GCCCGTCACA CCATGGGAGT GGGTTGCAAA AGAAGTAGGT 144 0 

AGCTTAACCT TCGGGAGGGC GCTTACCACT TTGTGATTCA TGACTGGGGT GAAGTCGTAA 150 0 

CAAGGTAACC GTAGGGGAAC CTGCGGTTGG ATCACCTCCT TA 1542 
(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1513 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 



TTTTTATGGA 


GAGTTTGATC 


CTGGCTCAGA 


GTGAACGCTG 


GCGGCGTGCC 


TAATACATGC 


60 


AAGTCGAACG 


ATGAAGCTTC 


TAGCTTGCTA 


GAAGTGGATT 


AGTGGCGCAC 


GGGTGAC5TAA 


12 0 


GGTATAGTTA 


ATCTGCCCTA 


CACAAGAGGA 


CAACAGTTGG 


AAACGACTGC 


TAATACTCTA 


180 


TACTCCTGCT 


TAACACAAGT 


TGAGTAGGGA 


AAGTTTTTCG 


GTGTAGGATO 


ARAfTATATA 


24 0 


GTATCAGCTA 


GTTGGTAAGG 


TAATGGCTTA 


CCAAGGCTAT 


GACGCTTAAC 


x x x unvjn 


3 0 0 


GGATGATCAG 


TCACACTGGA 


AC TGAGAC AC 


GGTCCAGACT 

V3VJ X v-' v_ OVJiiV- X 


V— >— x -£i.v_ \j\j\DJrwj 


RrAOPARTAR 

VJV .cWjy^, fWj X £\\J 


360 


GGAATATTGC 


GCAATGGGGG 


AAACCCTGAC 


GCAGCAACOr 


V — V_I V_ \_J J. VJ\JJ-1VJ\J 


ATOAPATTTT 


420 


TCGGAGCGTA 


AACTCCTTTT 


CTTAGGGAAG 


AATTCTGACG 


vj X ^l^w v X rUluu 




T O \J 


GGCTAACTCC 


GTGCCAGCAG 


CCGCGGTAAT 




\^-^LC\\JV- \J X J. 




Zj *± u 


T GGGCGT AAA 


GGGCGCGTAG 


GCGGATTATC 


AAGTCTCTTG 


TGAAATCTAA 


TGGCTTAACC 


600 


ATTAAACTGC 


TTGGGAAACT 


GATAGTCTAG 


AGT GAGGGAG 


AGGCAGATGG 


AATTGGTGGT 


660 


GTAGGGGTAA 


AATCCGTAGA 


TAT C ACCAAG 


AATACCCATT 


GCGAAGGCGA 


TCTGCTGGAA 


720 


CTCAACTGAC 


GCTAAGGCGC 


GAAAGCGTGG 


GGAGCAAACA 


GGATTAGATA 


CCCTGGTAGT 


780 


CCACGCCCTA 


AACGATGTAC 


ACTAGTTGTT 


GGGGTGCTAG 


TCATCTCAGT 


AATGCAGCTA 


840 


ACGCATTAAG 


TGTACCGCCT 


GGGGAGTACG 


GTCGCAAGAT 


TAAAACTCAA 


AGGAATAGAC 


900 


GGGGACCCGC 


ACAAGCGGTG 


GAGCATGTGG 


TTTAATTCGA 


AGATACGCGA 


AGAACCTTAC 


960 


CTGGGC TTGA 


TATCCTAAGA 


ACCTTTTAGA 


GATAAGAGGG 


TGCTAGCTTG 


CTAGAACTTA 


1020 


GAGACAGGTG 


CTGCACGGCT 


GTCGTCAGCT 


CGTGTCGTGA 


GATGTTGGGT 


TAAGTCCCGC 


1080 


AACGAGCGCA 


ACCCACGTAT 


TTAGTTGC TA 


ACGGTTCGGC 


CGAGCACTCT 


AAATAGACTG 


1140 


CCTTCGTAAG 


GAGGAGGAAG 


GTGTGGACGA 


CGTCAAGTCA 


TCATGGCCCT 


TATGCCCAGG 


1200 


GCGACACACG 


TGCTACAATG 


GCATATAGAA 


TGAGACGCAA 


TACCGCGAGG 


TGGAGCAAAT 


1260 


CTATAAAATA 


TGTCCCAGTT 


CGGATTGTTC 


TCTGCAACTC 


GAGAGCATGA 


AGCCGGAATC 


1320 
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GCTAGTAATC GTAGATCAGC CATGCTACGG TGAATACGTT CCCGGGTCTT GTACTCACCG 
CCCGTCACAC CATGGGAGTT GATTTCACTC GAAGC CGGAA TACTAAACTA GTTACCGTCC 
ACAGTGGAAT CAGCGACTGG GGTGAAGTCG TAACAAGGTA ACCGTAGGAG AACC TGCGGT 
TGGATCACCT CCT 

(2) INFORMATION FOR SEQ ID NO:160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1555 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 
TTTTATGGAG AGTTTGATCC TGGCTCAGGA TGAACGCTGG CGGCGTGCCT AATACATGCA 
AGTCGAGCGA ACGGACGAGA AGCTTGCTTC TCTGATGTTA GCGGCGGACG GGTGAGTAAC 
ACGTGGATAA CCTACCTATA AGACTGGGAT AACTTCGGGA AACCGGAGCT AATACCGGAT 
AATATTTTGA ACCGCATGGT TCAAAAGTGA AAGACGGTCT TGCTGTCACT TATAGATGGA 
TCCGCGCTGC ATTAGCTAGT TGGTAAGGTA ACGGCTTACC AAGGCAACGA TACGTAGCCG 
ACCTGAGAGG GTGATCGGCC ACACTGGAAC TGAGACACGG TCCAGACTCC TACGGGAGGC 
AGCAGTAGGG AATCTTCCGC AATGGGCGAA AGCCTGACGG AGCAACGCCG CGTGAGTGAT 
GAAGGTCTTC GGATCGTAAA ACTCTGTTAT TAGGGAAGAA CATATGTGTA AGTAACTGTG 
CACATCTTGA CGGTACCTAA TCAGAAAGCC ACGGCTAACT ACGTGCCAGC AGCCGCGGTA 
ATACGTAGGT GGCAAGCGTT ATCCGGAATT ATTGGGCGTA AAGCGCGCGT AGGCGGTTTT 
TTAAGTCTGA TGTGAAAGCC CACGGCTCAA CCGTGGAGGG T C ATTGG AAA CTGGAAAACT 
TGAGTGCAGA AGAGGAAAGT GGAATTCCAT GTGTAGCGGT GAAATGCGCA GAGATATGGA 
GGAAC AC C AG TGGCGAAGGC GACTTTCTGG TCTGTAACTG ACGCTGATGT GCGAAAGCGT 
GGGGATCAAA CAGGATTAGA TACCCTGGTA GTCCACGCCG TAAACGATGA GTGCTAAGTG 
TTAGGGGGTT TCCGCCCCTT AGTGCTGCAG CTAACGCATT AAGCACTCCG CCTGGGGAGT 
ACGACCGCAA GGTTGAAACT CAAAGGAATT GACGGGGACC CGCACAAGCG GTGGAGCATG 
TGGTTTAATT CGAAGCAACG CGAAGAACCT TACCAAATCT TGACATCCTT TGACAACTCT 
AGAGATAGAG CCTTCCCCTT CGGGGGACAA AGTGACAGGT GGTGCATGGT TGTCGTCAGC 
TCGTGTCGTG AGATGTTGGG TTAAGTCCCG CAACGAGCGC AACCCTTAAG CTTAGTTGCC 
ATCATTAAGT TGGGCACTCT AAGTTGACTG CCGGTGACAA ACCGGAGGAA GGTGGGGATG 
ACGTCAAATC ATCATGCCCC TTATGATTTG GGCTACACAC GTGCTACAAT GGACAATACA 
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AAGGGCAGCG AAACCGCGAG GTCAAGCAAA TCCCATAAAG TTGTTCTCAG TTCGGATTGT 1320 

AGTCTGCAAC TCGACTACAT GAAGCTGGAA TCGCTAGTAA TCGTAGATCA GCATGCTACG 13 80 

GTGAATACGT TCCCGGGTAT TGTACACACC GCCCGTCACA CCACGAGAGT TTGTAAC AC C 1440 

CGAAGC CGGT GGAGTAACCT TTTAGGAGCT AGCCGTCGAA GGTGGGACAA ATGATTGGGG 1500 

TGAAGTCGTA ACAAGGTAGC CGTATCGGAA GGTGCGGCTG GATCACCTCC TTTCT 15 55 
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